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Preface 


The demand for high-speed circuits and systems has steadily increased 
over time. Led initially by automotive safety (radar), silicon-germanium 
(SiGe) heterojunctiom bipolar transistors (HBTs) and their combination with 
complementary metal-oxide-semiconductor (CMOS) transistors into highly 
integrated BiCMOS process technologies have become a very attractive 
solution for a plethora of present and prospective applications operating at 
frequencies from 30 GHz up to several 100 GHz (so-called mm-waves). Such 
applications range from communications (presently driven by 5G) to imaging 
in public transportation (security), medicine and biology, just to name 
a few. Requiring device performance beyond CMOS, above applications 
were traditionally placed into the realm of III-V technologies. With the 
goal of providing highly integrated and cost effective mm-wave electronic 
systems employing high-speed SiGe HBT front-end circuitry integrated 
with high density CMOS digital processing capability, the two European 
Commission funded joint research projects DOTFIVE (2007-2010) and 
DOTSEVEN (2013-2016) were instrumental in making SiGe HBTs and 
BiCMOS technology competitive and attractive for the above mentioned 
applications. 

DOTSEVEN was a highly successful project, which not only lifted SiGe 
HBT performance to an unprecedented level but also provided the theoretical 
understanding of this new HBT technology as well as the demonstration of its 
capabilities. Key to the project’s success was the excellent cooperation within 
the consortium consisting of partners from the different and complementary 
areas of process engineering, device modeling and circuit design. This book 
summarizes the important results of DOTSEVEN in more detail than the 
many associated publications and thus addresses not only expert readers 
familiar with the technology but also students and others who like to learn 
more about SiGe HBT technology or need a concise overview in one of 
the associated research areas. In each chapter, the research described is 
motivated and put in perspective by an introduction that provides sufficient 
background and additional literature for supporting the understanding. 


al 


xii Preface 


Not covered though are fundamentals of device physics, process technology 
and circuit design. Here, the reader is referred to the standard text books in 
each area. 

This book is organized as follows. The Introduction provides the 
motivation for pursuing HBTs, in particular those based on SiGe, and for 
setting-up the DOTSEVEN project. Its organization and a brief description 
of the various work packages are presented, which mirror the relevance of 
the respective topics for the project. Here, also the most important results are 
briefly summarized in a single place. 

Chapter 1 “SiGe HBT Technology” first introduces the relevant metrics 
used to evaluate transistor performance. Then, those HBT device and process 
architectures are explored that have been pursued within DOTSEVEN. It 
is shown why the conventional double-polysilicon self-aligned selective 
epitaxial growth approach limits the simultaneous increase of transit and 
power gain cut-off frequency. The solutions overcoming these issues and 
leading to the new vertical and lateral architecture developed within 
DOTSEVEN is described and documented by experimental results. 

Chapter 2 “Device Simulation” addresses different types of numerical 
simulation and visualizes the results through many examples. First, the 
simulation of isothermal (in terms of the device temperature) carrier transport 
is discussed. In particular, deterministic solutions for the Boltzmann transport 
equation and the trade-offs necessary for applying computationally more 
efficient simulation approaches for device optimization are described as they 
were pursued within the project. The second part covers electro-thermal 
simulation based on coupling the impact of phonon scattering and 
corresponding self-heating with carrier transport simulation. The resulting 
simulated temperature increase is used to develop methods for accurately 
determining the thermal resistance from the simulated DC characteristics. 
Third, hot-carrier effects in advanced SiGe HBTs are investigated employing 
microscopic simulation. 

Chapter 3 “SiGe HBT Compact Modeling” starts with an overview on the 
standard HBT compact model HICUM Level 2. Afterwards, the most recent 
extensions related to DOTSEVEN, which were mostly targeted towards 
the intrinsic device operation are described. This is followed by a detailed 
discussion of the corresponding parameter extraction methods, including 
bias, geometry and temperature dependence. Geometry scaling and modeling 
of the intra-device substrate coupling concludes this chapter. 

Chapter 4 “(Sub)mm-wave Calibration” addresses the electrical high- 
frequency on-wafer characterization of the fabricated HBTs, which 
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becomes inaccurate beyond 220 GHz when using traditional methods. The 
conventional calibration and deembedding techniques are reviewed, followed 
by a discussion of the signal propagation modes in transmission lines on 
lossy substrates as they are encountered on a silicon wafer. Direct on-wafer 
calibration up to the device-under-test ports is then introduced, using a 
special transmission line configuration. As a demonstration, a comparison 
of measurements using this technique with HICUM is shown for frequencies 
up to 325 GHz. 

Chapter 5 “Reliability” reports on the medium- and long-term 
degradation of the fabricated advanced HBTs due to hot carrier stress. 
The measured impact of hot carriers generated during device operation 
on DC and low-frequency noise characteristics is shown and compared to 
the results of a correspondingly extended HICUM version. Furthermore, 
self-heating is investigated, which is becoming increasingly important in 
downscaled high-performance devices and plays an important role in device 
degradation. A method for determining the thermal resistance is described 
and corroborated on thermal simulation data. Finally, different analytical 
models for describing the thermal resistance as a function of geometry are 
compared. 

Chapter 6 “Millimeter-wave Circuits and Applications” is divided into 
three parts. First, simple basic building blocks termed benchmark circuits 
are considered which serve mostly for verifying the compact models in a 
circuit environment, but can also be used for benchmarking the technology 
performance. For each of the selected examples a sensitivity study reveals 
the most important transistor parameters. The second part reports on larger 
circuit building blocks as they would be used in a system. Examples are 
given that demonstrate competitive high-frequency circuit operation down 
to 0.5 V supply voltage and thus very low power consumption. The third 
part describes the three mm-wave demonstrator systems and their building 
blocks that were realized with DOTSEVEN technology, namely a 240 GHz 
Transceiver for ultra-high data rate wireless communication, a 210-270 GHz 
circularly polarized radar, and a 0.5 THz computer tomography system. The 
latter is the first-ever purely silicon based tomography system operating at 
such frequency. 

Chapter 7 “Future of SiGe HBT Technology and Its Applications” puts 
the DOTSEVEN results in perspective by comparing, based on the standard 
performance metrics, the different technology options for mm-wave and 
future sub-mm-wave applications. A more intriguing perspective is provided 
when looking at the transistor operation within circuits, where the load 
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comprised of parasitics from connections to other devices and the latter 
themselves paints a different picture of transistor speed as given by standard 
metrics. In view of the ITRS/IRDS roadmap for SiGe HBTs, which grew out 
of the modeling effort in DOTFIVE and DOTSEVEN, it is shown that HBTs 
in general have a bright future for realizing high-frequency systems. Their 
near future is discussed for different (sub-)mm-wave application areas based 
on DOTSEVEN results as a reference. 

We hope that this book is a useful reference for a wide range of readers 
interested in mm- and sub-mm-wave technology, devices, and applications. 


Naples and Dresden/San Diego Niccolo Rinaldi 
February 2018 Michael Schroter 


Acknowledgements 


This work was supported by the European Commission under the contract 
No. 316755-DOTSEVEN. Fruitful collaboration with all partners of the 
DOTSEVEN consortium is gratefully acknowledged. In particular, the 
authors gratefully acknowledge the numerous contributions of the HBT 
development teams of IHP and Infineon with special thanks to: R. Barth, 
A. Fox, J. Korn, T. Lenke, S. Marschmeyer, A. Scheit, D. Schmidt, and 
D. Wolansky of IHP, and J. Bock, S. Boguth, K. Knapp, R. Lachner, 
W. Liebl, D. Manger, T. F. Meister, and A. Pribil of Infineon. We thank 
W. Skorupa and T. Schumann of Helmholtz-Zentrum Dresden-Rossendorf 
and S. Haberlein of FHR Anlagenbau GmbH for their support with 
millisecond-annealing capabilities. The authors of chapter 3 like to thank 
P. Sakalas for his measurement support. The authors of chapter 4 would 
also like to thank the European Metrology Programme for Innovation and 
Research (EMPIR) Project 14INDO2 “Microwave measurements for planar 
circuits and components”, for partially supporting the reported work. The 
EMPIR program is co-financed by the participating countries and from the 
European Unions Horizon 2020 research and innovation program. 

We, the editors, greatly appreciate the effort of our (co-)authors to 
contribute the various chapters and to spend a certainly significant portion of 
their busy schedule to finally bring this book to completion. We would also 
like to thank Mark deJong for enabling the publication of this book and Junko 
Nagajima who tirelessly worked through several iterations of corrections for 
assembling the diverse contributions into a homogeneous final version. 


XV 


Taylor & Francis 
Taylor & Francis Group 


http://taylorandfrancis.com 


List of Contributors 


Anindya Mukherjee, Chair for Electron Devices and Integrated Circuits, 
Technische Universität Dresden, Helmholtzstr, 18, Barkhausenbau, 01062 
Dresden, Germany 


Andreas Pawlak, Chair for Electron Devices and Integrated Circuits, 
Technische Universität Dresden, Helmholtzstr, 18, Barkhausenbau, 01062 
Dresden, Germany 


Alessandro Magnani, Department of Electrical Engineering and Information 
Technology, University Federico I, via Claudio 80125, Naples, Italy 


Bertrand Ardouin, XMOD Technologies, 74 rue G. Bonnac, 3300 Bordeaux, 
France 


Bernd Heinemann, JHP, Im Technologiepark 25, 15236 Frankfurt (Oder), 
Germany 


Christoph Jungemann, RWTH Aachen University, 52056 Aachen, Germany 


Cristell Maneux, Laboratory of Integration of Material to System (IMS), 
University of Bordeaux, Bordeaux, France 


Gerald Wedel, Chair for Electron Devices and Integrated Circuits, 
Technische Universität Dresden, Helmholtzstr, 18, Barkhausenbau, 01062 
Dresden, Germany 


Gerhard G. Fischer, JHP, Frankfurt (Oder), Germany 


Holger Rücker, JHP, Im Technologiepark 25, 15236 Frankfurt (Oder), 
Germany 


xvii 


xviii List of Contributors 


Janusz Grzyb, Institute for High-Frequency and Communication Technology, 
University of Wuppertal, Wuppertal, Germany 


Klaus Aufinger, Infineon Technologies AG, Neubiberg, Germany 


Luca Galatro, /. Electronic Research Laboratory, Delft University of 
Technology, Mekelweg 4, 2628CD, Delft, The Netherlands 
2. Vertigo Technologies B.V., Mekelweg 4, 2628CD, Delft, The Netherlands 


Michael Schröter, 7. Chair for Electron Devices and Integrated Circuits, 
Technische Universität Dresden, Helmholtzstr, 18, Barkhausenbau, 01062 
Dresden, Germany 

2. Department of Electrical and Computer Engineering, University of 
California at San Diego, 9500 Gilman Drive, La Jolla, CA 92093-0407, USA 


Marco Spirito, Electronic Research Laboratory, Delft University of 
Technology, Mekelweg 4, 2628CD, Delft, The Netherlands 


Niccolo Rinaldi, Department of Electrical Engineering and Information 
Technology, University Federico II, via Claudio 80125, Naples, Italy 


Philipp Hillger, Institute for High-Frequency and Communication Technology, 
University of Wuppertal, Wuppertal, Germany 


Ritesh Jain, Institute for High-Frequency and Communication Technology, 
University of Wuppertal, Wuppertal, Germany 


Salvatore Russo, Department of Electrical Engineering and Information 
Technology, University Federico II, via Claudio 80125, Naples, Italy 


Ullrich Pfeiffer, Chair for the Institute for High-Frequency and 
Communication Technology (IHCT), Universiry of Wuppertal, Wuppertal, 
Germany 


Vincenzo d’Alessandro, Department of Electrical Engineering and 
Information Technology, University Federico II, via Claudio 80125, Naples, 
Italy 


Wenfeng Liang, Chair for Electron Devices and Integrated Circuits, 
Technische Universität Dresden, Helmholtzstr, 18, Barkhausenbau, 01062 
Dresden, Germany 


Figure 1 


Figure 2 


Figure 1.1 


Figure 1.2 


Figure 1.3 


Figure 1.4 


Figure 1.5 


Figure 1.6 
Figure 1.7 
Figure 1.8 


Figure 1.9 


List of Figures 


List of DOTSEVEN project partners and their 
home countries. oo ai a ae e O E a E a a 
Location of the THz frequency range within 

the electromagnetic spectrum. The overlap region 
(THz-gap) between electronic and photonic 
approaches around 1 THz is indicated. ....... 
Peak fr and fmax values of high-speed SiGe HBT 
technologies. Red diamonds indicate results 

of the DOTSEVEN project.............. 
Schematic cross section of a high speed SiGe 

HB Vein Be ee BE ee OR ACE BO aed 
Device cross section with parasitic resistances 

and capacitances associated with different device 
regions (a) and a corresponding small signal 
equivalent circuit (b). . . 2... .......004. 
Schematic vertical doping profile of a SiGe HBT. 
The dashed lines indicate a scaled profile 

for enhanced fi, aora mei a ee o E a a a 
Device cross section with lateral device dimensions 
and major contributions of the base-link region 
toRpandCpc. .... 2... T aoa Aa 
Schematic cross sections of conventional 
DPSA-SEG process flow. .............. 
TEM cross section of a DPSA-SEG HBT 

of Infineon’s B11HFC technology. ......... 
Different base link configurations of selective 
epitaxial growth (SEG) HBTs. ........... 
Schematic cross sections of the EBL process flow 
after emitter structuring (a) and after selective 
growth of the EBL (b). ............... 


xix 


14 


14 


23 


xx List of Figures 


Figure 1.10 


Figure 1.11 


Figure 1.12 


Figure 1.13 


Figure 1.14 


Figure 1.15 


Figure 1.16 


Figure 1.17 


Figure 1.18 


Figure 1.19 


TEM cross section of IHP’s EBL-HBT module 

on Infineon’s 130 nm platform for the bipolar-only 

run (left) and full BiCMOS flow (right), ...... 29 
Transit frequency f'r and maximum oscillation 

frequency fmax vs. the collector current density 

Jc for Infineon/THP EBL fabrication in a bipolar- 

only process and in a BiCMOS run vs. the IHP 


EBL reference. ...........-0.000006 31 
Schematic cross section of an NSEG HBT 
with elevated extrinsic base regions. ........ 34 


Process sequence for the NSEG HBT with elevated 
base regions: (a) after SIC formation, (b) after 
non-selective growth of the base, (c) before 


emitter deposition, (d) after emitter structuring. .. 34 
TEM cross section of an NSEG HBT of the 
technology SG13G2. ................ 35 


fr and fmax vs. collector current density for an 

HBT with BEC layout configuration. Eight HBTs 

in parallel with individual emitter areas 

of 0.17 x 1.01 wm? were measured. ........ 37 
Layout configurations: (a) BEC configuration with 

base and collector contacts at the ends of the 

emitter line, (b) CBEBC configuration with base 

and collector contact rows parallel to the emitter 

Hiena beat ee a GO he Dae eh whe taa a a 37 
(a) Depth profiles of Ge (blue circles) and B (red 

squares) measured by SIMS. Open symbols 

are as-grown profiles. Filled symbols are profiles 

after the full fabrication process. (b) Ge depth 

profiles measured by EDX in a 600 um x 400 pm 
window (blue) and in a typical HBT structure 

(Eteen) kidman dee aah ol hee ey alee ted 38 
SIMS profiles measured after the final annealing 

step. The theoretically proposed doping profile N3 

of is shown for comparison. . ............ 39 
Measured and simulated fr vs. collector current 

density. Simulations were performed with 

the hydrodynamic model in 2D and 1D. Results 


Figure 1.20 


Figure 1.21 


Figure 1.22 


Figure 1.23 


Figure 1.24 


Figure 1.25 


List of Figures 


obtained from the 1D Boltzmann transport equation 
are shown for comparison. The measured device 
has an emitter area of 0.28 um x 5.0 um and the 


CBEBC layout corresponding to Figure 1.16(b). . . . 


TEM cross sections of HBTs from the process splits 
G2 (a), CR2 (b), and D7s(c).. 2... ee 
Transit frequency f'r and maximum oscillation 
frequency fmax vs. collector current density 

for devices of the split CR2 (second circuit 
fabrication run in DOTSEVEN) compared 

to the reference process G2. Device dimensions 

are given in Table 1.3................. 
Transit frequency f'r and maximum oscillation 
frequency fmax vs. collector current density for 
devices of the split G2N (nickel silicide) and G2NF 
(nickel silicide and flash annealing) compared to the 
reference process G2. Device dimensions 

are given in Table 1.3................. 
De-embedded small-signal current gain hg, and 
unilateral gain U vs. frequency of the device D7s 
used for extraction of transit frequency fr 

and maximum oscillation frequency fmax with 
—20 dB decay per frequency decade. 

The emitter area is 8 x (0.105 x 1.0) wm?. 

Transit frequency f'r and maximum oscillation 
frequency fmax vs. collector current density 

for two device geometries (D7 and D7s) 

of the latest process status of DOTSEVEN 
compared to the reference process G2 and the 

split CR2. Device dimensions are given in 
Tablecd3% te Vr gots Mt BAe God hha dk Sd 
CML ring oscillator gate delays vs. current 

per gate for oscillators consisting of 31 stages 

with single-emitter HBTs for the splits G2 (Ag = 
0.12 wm x 1.02 um), CR2 (Ag =0.1 wm x 1.0 um), 
D7, and D7s (Ag = 0.105 um x 1.02 wm)... . .. 


XX1 


40 


42 


43 


44 


45 


46 


xxii List of Figures 


Figure 1.26 


Figure 2.1 
Figure 2.2 


Figure 2.3 


Figure 2.4 


Figure 2.5 


Figure 2.6 


Figure 2.7 


Figure 2.8 


Figure 2.9 


Figure 2.10 


Gummel characteristics (a) and base-current forced 
output characteristics (b) for the splits D7s and G2. 
Symbols in (b) indicate the bias points for peak fr. 
The emitter areas are 8 x (0.12 x 1.02) um? for G2 
and 8 x (0.105 x 1.0) wm? forD7s......... 
Flowchart for finding the vertical and lateral HBT 
structure of a major technology node......... 
Net doping and Germanium profile of a SiGe HBT 
with fp =630 GHz... .......2..-000.. 
Transfer characteristic (left) and transit frequency 
(right) obtained from DD transport and BTE for the 
device of Figure 2.2. Vog = 1V........... 
Electron velocity and density obtained by DD and 
BTE simulation... ..............02. 
Illustration of the impact of ftt on the transfer 
characteristic, transit frequency and output 
characteristics for fo = f° = 0... nnna aan 
Illustration of the impact of ft° on the transfer 
characteristic, transit frequency and output 
characteristics at ft? = f= 0. .......... 
Exemplary illustration of the impact of 

f° (ftd= fte = 0) on the transfer characteristic, 
transit frequency and output characteristic... ... 
Transfer characteristic and transit frequency of 
the SiGe HBT in Figure 2.2 obtained from HD 
simulation with adjusted HD transport model 
parameters and SDevice defaults [Syn15], compared 
to BTE and DD simulation results. ......... 
(a) Net doping and Ge profile of the SiGe HBT 
given in [Sch11] and the corresponding (b) transfer 
characteristic and (c) transit frequency obtained 
from BTE and HD simulation. ........... 
(a) Band edges and Ge profile, (b) valley occupancy, 
and (c) electron density of the SiGe HBT shown 
in Figure 2.9 obtained by BTE simulation at the 
operating point of peak transit frequency... .... 


47 


56 


66 


67 


67 


68 


69 


70 


71 


72 


Figure 2.11 


Figure 2.12 


Figure 2.13 


Figure 2.14 


Figure 2.15 


Figure 2.16 


Figure 2.17 


Figure 2.18 


List of Figures 


(a) Band edges and graded Ge profile as well as the 
corresponding (b) valley occupancy and (c) electron 
density obtained by BTE simulation at the operating 
point of peak transit frequency. ........... 
(a) Comparison of (a) the total 1D capacitance 
connected to the base node and its components and 
(b) the transconductance, obtained for the initial 
(abrupt) and the new (graded) SiGe HBT 


(a) Comparison of the initial and the new SiGe HBT 
profile with corresponding (b) transfer characteristic 
and (c) transit frequency, obtained by both BTE and 
HD simulation. a sare oe Bom ea a eua E 
(a) Doping and Germanium profile of a SiGe HBT 
and the corresponding (b) transfer characteristic and 
(c) transit frequency obtained from BTE and HD 
SIMULAN i ge a a ga Gy Salaa AE a Ee 
Transconductance and total capacitance of the N3 
SiGe HBT ites. Soe me ed a E de a 
Conduction band edge versus location and 
superimposed contour lines of the (logarithm of the) 
electron distribution function of the 4-fold A-valley 
over energy within the emitter-base region for three 
operating points: (a) around fT BTE,pk/2, 

(b) just before ft BTE, pk. and (c) and just 

after ÍT, BTE,pk- BIR ie ab ats Ria A iby ys po NAS 
Electron distribution function within the 

4-fold A-valley just below fT BTE,pk within the 
emitter-base region. (a) Contours with the arrow 
marking the position of the doping induced 
conduction band barrier. (b) Comparison of HD and 
BTE distribution function at the barrier. ...... 
Comparison of (a) the transconductances and 

(b) the total capacitance obtained by DD, HD 

and BTE simulation results... ........... 


xxiii 


77 


xxiv List of Figures 


Figure 2.19 


Figure 2.20 


Figure 2.21 


Figure 2.22 


Figure 2.23 


Figure 2.24 


Figure 2.25 


Comparison of the BTE and HD electron densities 
obtained by (a) an DC and (b) an quasi-static 
analysis. In (b), also the quasi-static hole densities 
are shown. For (b), a different axis intercept is 

used compared to (a) in order to visualize the 
contributions to the emitter junction 

capacitance Cki- aa tie Bee wel oat, a the Da aae 
Comparison of Cp; obtained by BTE and HD 
simuUlations...6-5.5: 4.8 aie aoa a heel Bat aca Seah dice 
(a) Doping profile with smoothed high to low 
transition in the emitter (new) and previous 
step-like profile (ini.). Corresponding terminal 
characteristics: (a) transfer current and (b) transit 
frequency. he ee RH ee eee 
Electron distribution function within the 4-fold 
A-valley at fT BTE,pk Within the emitter-base 
region. (a) Contours with the arrow marking the 
position of the doping induced conduction band 
barrier. (b) Comparison of HD and BTE distribution 
function at the barrier peak... ........... 
(a) Doping and Ge profile of a SiGe HBT 
(fabricated by IHP) with corresponding (b) transfer 
current, and (c) transit frequency. Comparison HD 
and BTE simulation results with 1D measurement 
data eed riae se’ erie Rake a raneliy ites 
Comparison of the Ge concentration induced 
bandgap narrowing from experimental data (exp.) 
and device simulation model (mod.). In addition, 
the lower and upper boundary (lb and ub) for 
bandgap narrowing as function of Ge the presence 
of metastable strain is shown............. 
Comparison of the 1D measurement data with DD, 
HD and BTE simulation results. For the DD and HD 
simulation, the linear model indicated in Figure 2.24 
is used. The transit frequency obtained by DD is not 
shown, since its parameters have not been adjusted. 


81 


84 


Figure 2.26 


Figure 2.27 


Figure 2.28 


Figure 2.29 


Figure 2.30 


Figure 2.31 


Figure 2.32 


Figure 2.33 


Figure 2.34 


Figure 2.35 


List of Figures 


Comparison of the performance trends predicted 
by TCAD (HD and BTE simulation) with 1D 
measurement results (three samples) for a process 
split with four different SiGe HBTs (fabricated 
BVHE); ice hon te & eee E ay a al BoP ae aed 
Thermal energy transport diagram in semiconductor 
CEVICES sae Le tee ah Soe: Bort eid dod 
(Top) Thermal conductivity in the 2-D SiGe HBT 
structure by taking into account the effect of 

Ge content, doping concentration, and boundary 
scattering at 300 K. (Bottom) Self-consistent lattice 
temperature at Vgg = 0.9 V and Vcg = 1 V. 

Profiles of the power densities received and 
dissipated by carriers which are calculated from 
Joule-heating and energy loss rate due to inelastic 
phonon scattering, respectively, along the symmetry 
axis of the HBT at Vg = 0.9 V and Vcg = 1 V. 
(Top) The LO phonon distribution function (zeroth- 
order harmonic), and (bottom) lattice temperature 
and effective temperature for LO phonons, along the 
symmetry axis of the investigated HBT at Vpp = 0.9 
V and Vor fF) DN Gee ae aR SE eer 
(Top) Ic — Vee characteristics for different 
homogeneous temperatures at Vog = 0.6 V. Solid 
lines show the isothermal simulation results at Tp 
= 300, 320, 340, 360 K and symbols show the 
corresponding measurement data. (Bottom) Ver — 
Vcg characteristics from electrothermal simulation 
and measurement at Ig =2 MA. .......... 
Ic — Ver characteristics with and without including 
self-heating compared to measurement data at 

Vom EI Vee kk Oo be ele elas tad ohn te 
(Left) Creation of Si dangling bonds at the Si/SiO2 
interface. (Right) Passivation of the dangling bonds 
by incorporating hydrogen atoms. ......... 
Schematic of a state-of-the-art SiGe HBT with the 
corresponding EB spacer and STI oxides. ..... 
The energy configuration of the Si-H bond modeled 
as a truncated harmonic oscillator. ......... 


XXV 


85 


86 


90 


91 


92 


93 


94 


xxvi List of Figures 


Figure 2.36 


Figure 2.37 


Figure 2.38 


Figure 2.39 


Figure 2.40 


Figure 3.1 


Figure 3.2 


Figure 3.3 


II generation rates in the SiGe HBT induced by 
electrons (top) and holes (bottom) at P3. ...... 
Cut of EDFs [eV~! cm~] for electrons (top) and 
holes (bottom) along the symmetry axis of the HBT 
ERI ar acs ante Digs GP ae Bak ae Be tesla a, aire 
(Top) EDFs of electrons (dashed lines) and holes 
(solid lines) at the intersection of the EB spacer 
oxide interface and the EB junction [denoted by 
node X in Figure 2.36 (Bottom)]. Profiles of the 
AB Als for electrons (dashed lines) and holes (solid 
lines) along the EB spacer oxide interface from node 
A to C denoted in Figure 2.36. ........... 
Interface trap densities generated at different stress 
time steps from node A to C denoted in Figure 2.36 
AU Pi ooh. cease ten ol Sek cae pihertener E A ined 
(Top) Gummel characteristics (Vcg = 0 V) of the 
fresh and degraded SiGe HBT after 1,000 h at P3 
obtained from simulation (lines) and measurement 
(symbols). (Bottom) Excess base currents over the 
stress time obtained from simulation (lines) and 
measurement (symbols) at Vag = 0.67 V and 

Veb = OV is sino OSES Re Bi ees 
Equivalent circuit of HICUM/L2 including the 
adjunct networks for modeling electro-thermal 
effects and NQS effects. Not shown are the 
networks for modeling correlated noise. The 
dash-dotted line defines the intrinsic (1D) transistor 
representation and the dashed line defines the 
internal transistor... 1... 0... ..2. 0000. 
Spatial dependence of the weight functions (a) hy 
and hy for Jc = 5mA /um?, and (b) hg (solid line) 
for low injection. In both pictures the dotted line 
shows the 1D doping profile in log-scale. In (b), the 
dashed line shows the bandgap in linear scale. . . . 
(a) Transfer current for transistors with different Ge 
profiles in the base. (b) Transfer current normalized 
to their ideal formulation for the same transistors as 
in (a) at room temperature and Vpq» =OV..... 


100 


104 


117 


120 


121 


Figure 3.4 


Figure 3.5 


Figure 3.6 


Figure 3.7 


Figure 3.8 


Figure 3.9 


Figure 3.10 


List of Figures 


Visualization of the depletion charge in the 
base-emitter space charge region for two bias points 
with increasing voltage from the top picture to 

the bottom. The intrinsic carrier density is given 

in log-scale. xjg relates to the metallurgic BE 
junction, while xe and xeo, respectively, are the 
boundaries of the space charge region for 

Vg > OV and Vp pg = OV, respectively. ; 
Application of the model Equation (3.19) for the 
weight factor obtained to transistors with different 
shapes of the Ge profile... . o ooa 
Spatial dependence of the electron density 
normalized to ne from (3.11) in the neutral base, 
marked by the vertical dashed lines, for different 
values of the field factor ¢. The x-axis is normalized 
to the metallurgical base width wgm with the BE 
junction located at x = 0... aoaaa aaa 
(a) Determination of the voltage drop in the neutral 
emitter (AVe) and in the BE SCR (AV pg) from the 
quasi-fermi potential of the electrons for transistors 
with different Ge profiles in the base. The dashed 
lines show the begin and end of the BE SCR. (b) 
Current dependence of the voltage drops; AV gg is 
shown for bias points only up to the beginning of 
high current effects... ............... 
(a) Spatial dependence of the hole density for 
transistors with different Ge profiles. The Ge 
content is given by the dotted lines for the box (left) 
and the graded (right) profile. The vertical dashed 
lines are the same as in Figure 3.7(b) Minority 
charge as a function of collector current density in 


the BE SCR for transistors with different Ge profile. 


Current dependence of the weight factors for the 
charge stored in the neutral emitter and BE SCR for 
the transistors with different Ge profiles. ...... 
Bias dependence of hfoọ extracted from 1D 
device simulations and application of the model 
equation (3:23)... eo dw VE a has Bel a eu aes 


xxvii 


123 


125 


126 


127 


127 


128 


129 


xxviii List of Figures 


Figure 3.11 


Figure 3.12 


Figure 3.13 


Figure 3.14 


Figure 3.15 


Figure 3.16 


Figure 3.17 


Figure 3.18 


Application of the model equations (3.29) and 


(3.32) to hjzio(T) and hfo(T) as well as Equation (3.31) 


to aņjgi obtained from 1D device simulations. . . . 
Application of (3.33) to the high current weight 
factors hrg and hrc obtained from 1D device 

simulations. .. 2... 2... 2 ee ee 
(a) Impact of 6, on the voltage dependence of IcK. 
Solid lines show the actual [cx while dashed lines 
show the results for the low-voltage portion of [i.e., 
the first term of (3.34), neglecting punch-through]. 
(b) Impact of ajcKpt on Ick for a very small 

punch-through voltage. . . 2... .......0.. 
Spatial dependence of the conductance band edge 
for low and high current densities (solid lines), 
highlighting the presence of the barrier at high 
current densities. The dashed line shows the doping 
profile of the transistor just for reference... .... 
Modeling of the current dependence of the 
heterojunction barrier voltage for different voltages 
Vom V = [0.3; 0.6; 0.9; 1.2; 1.5]. oak Ra 
Transit times of a SiGe HBT showing the BC barrier 
effect: comparison of Equations (3.39) and (3.40) 
with results from 1D device simulation for different 
voltages Vær /V = [0.3; 0.5; 0.8; 1.0; 1.2; 1.5]. .. 
Sketch of the cross section for (a) a junction isolated 
(with partial trench-isolation) and (b) a deep trench 
isolated collector including all relevant elements 
of the most simple equivalent circuit for modeling 
intra-device substrate coupling. Note that for all 
series resistance a parallel capacitance exists due 
to the permittivity of the substrate and that due to 
changes of the substrate-collector SCR all depletion 
capacitances Cjs(a,p) and series resistances Rgu,(a,p) 
are bias dependent. .. . ooo 
Impact of intra-device substrate coupling on the 
dynamic output-conductance given by the real part 
of Yə2. Solid lines show the actual values including 
substrate coupling while the dashed lines show 
results with an ideally open substrate. ....... 


131 


133 


136 


Figure 3.19 
Figure 3.20 


Figure 3.21 


Figure 3.22 


Figure 3.23 
Figure 3.24 
Figure 3.25 
Figure 3.26 


Figure 3.27 


Figure 3.28 


Figure 3.29 


Figure 3.30 


Figure 3.31 


List of Figures 


Substrate coupling equivalent circuit in HICUM/L2. 


Illustration of the base series resistance components 
and their relation to the HBT cross section 
(Schematic). 6) 2. atc ce SAR A 
Typical test structure (a.k.a. contact chain) used for 
determining the specific electrical resistivities of the 
external base resistance components. B1, B2, B3 
designates the contacts... ............. 
Internal base sheet resistance, normalized to its 
zero-bias value rgpig, as extracted from tetrodes for 
different bias conditions and technologies. .... . 
Extraction of hjg; based on (3.48). 
Extraction results for dpjzi based on (3.53) for 
different starting values fj. ............. 
Extraction of hjp; based on (3.54) including the 
correction based on the extracted apjpi.. . . 
Extraction results and application of the temperature 
model for aņjri (3.31) and hjgio (3.29)... 2... 
Extracted values for hy from (3.56) for (a) 
room temperature and different Vgc. (b) Extracted 
values chosen at low current densities for different 
temperatures and Vgc. 
Extraction results for the bias-dependent hto at room 
temperature for the results given in Figure 3.27(b). 
(a) Extracted hep values at room temperature 

for different Vgc as indicated by the arrow. 

(b) Extracted temperature dependence. ....... 
(a) Extracted values for htc at room temperature for 
different Vgc as indicated by the arrow. 

(b) Extracted temperature dependence. ....... 
The collector current flow in the emitter can be 
split into an intrinsic portion related to the emitter 
area and a portion related to the perimeter only. 
The right picture shows the spatial distribution of 
the vertical electron current density. Marked are the 
actual emitter width bgo as well as the effective 
emitter width bp, calculated from bgo and yc (cf. 
3.58, 3.60). 


XX1X 


139 


142 


142 


149 


151 


152 


152 


xxx List of Figures 


Figure 3.32 


Figure 3.33 


Figure 3.34 


Figure 3.35 


Figure 4.1 


Figure 4.2 


Figure 4.3 


Schematic illustration of the effective emitter area 
concept. The area and perimeter currents are 
gathered in a single effective contribution. ..... 
Experimental data of Ic/Ago versus Pgo/Aro for 
Ver = 0.45-0.5 V in steps of 10 mV. Ago and 
Pgo, respectively, are the actual emitter window 
area and perimeter, respectively. The drawn emitter 
dimensions are (0.31, 0.35, 0.4, 0.53, 0.7, 1.2, 2.2) x 
10 um?. Symbols represent measured data and 
dashed lines results from linear regression... .. . 
Schematic illustration of the generalized effective 
emitter area concept. ................ 
Required matrix of test structures for scalable 
parameter extraction in the case of the generalized 
scaling laws: 3.43 ove dA Be Ak eas Sot SE 
Cross section of a coplanar wave guide (CPW) with 
finite ground planes, and sketches of the E field 
distributions of the first propagating modes 
supported: Aa ran ss, BO ae, Be ere hd 
Cross section of CPW placed (a) on metal chuck, 
(b) on absorber. Electrical field intensity below the 
CPW metal plates (5 um) for case (c) no absorber 
and (d) with absorbing boundary conditions in the 
3D FEM simulation, both fields were computed at 
80 GHZ 3,3 yas sae EA eee ie eae a a e 
(a) Simulated S21 of CPW on alumina substrate for 
various cases: CPW no GAP sub = 0 um 

GAP = 0 um, CPW GAP subl sub = 320 um 
GAP = 500 wm, CPW GAP sub2 sub = 420 wm 
GAP = 500 um; (b) CPW structure used in the 
EM simulator with highlight on the lumped bridge 
configuration; (c) measurement of different (4) thru 
lines on alumina substrate in different locations of 
the calibration substrate. Locations (i.e., two 
middle and two center) identified in the inset on 

TOP TIEN, sho gases ce Ge eh ol a cae gh ah a POG hee Ce 


155 


156 


156 


157 


164 


166 


Figure 4.4 


Figure 4.5 


Figure 4.6 


Figure 4.7 


Figure 4.8 


Figure 4.9 


Figure 4.10 


List of Figures 


(a) Simulated S21 of CPW on fused silica substrate 
for various cases: CPW no GAP sub = 0 pm 

GAP = 0 um, CPW GAP subl sub = 320 wm 
GAP = 500 um, CPW GAP sub2 sub = 420 pm 
GAP = 500 um; (b) measurement versus 
simulation of a thru line on fused silica substrate. 
Schematic representation of the capacitive coupling 
between the probe tip and the substrate where the 
device under test (DUT) is embedded. ....... 
Worst case error bound for calibration transfer from 
fused silica and alumina to SiGe BEOL, before 
correction (full symbols, solid lines) and after 
correction (empty symbols, dotted lines), obtained 
with on-wafer measurements on a 600 um CPW 
line manufactured on IHP SiGe 130 nm BiCMOS 
technology, in the frequency range from 75 to 

TIO GHZS (i: Beene oe oA, oe an ee a Ae ae 
Coplanar wave guide CPW calibration structures 
realized on IHP SiGe 130 nm BiCMOS technology. 
(a) Microphotograph of the thru line, (b) of the 
reflect standard and (c) of the transmission line 
employed for the WROS calibration kit, 

(d) schematic cross section of the CPW line... .. 
Field distribution on waveguide ports at 300 GHz 
when exciting the structures described in Figure 4.7, 
for (a) Keysight EMPro and (b) Ansoft HFSS. .. . 
Real part of characteristic impedance for the 

line shown in Figure 4.7(a), computed with the 
simulation approach described in (solid lines 
EMPro, dashed lines HFSS, dashed-dot lines CST), 
and measured using the method of (empty circles) 
and the method of (filled squares). ......... 
Comparison of probe-tips corrected measurements 
of a verification line manufactured on the SiGe 
BEOL in the frequency range 75-325 GHz 

for different calibrations. .............. 


XXX1 


168 


173 


xxxii List of Figures 


Figure 4.11 


Figure 4.12 


Figure 4.13 


Figure 4.14 


Figure 4.15 


Figure 5.1 


Figure 5.2 


Simplified schematic top-view of a generic 
test-structure realized with CL-ICPW. (a) Input 
section, (b) M7-M1 vertical transition and (c) DUT 
Stades a2. dt deni we Se eee OG dead 
Schematic cross section of the input stage used for 
the test structures (a). 3D model of the vertical 
transition connecting the central conductor of the 
input stage in M7 to the CL-ICPW central conductor 
in M1 (b). Schematic cross section of the CL-ICPW 
employed in the DUT stage of the calibration 
KIC) ee Boe tide ae EOC A Oh A 
Micrograph of the de-embedding kit on Infineon 
BI1HFC technology. ................ 
Top view of the transistor (EBCEB) integrated into 
the test-structure (a) Detailed view of the layout 
for the integrated transistor (b), highlighting the 
input and output fixture (in yellow) required to 
guarantee connection to the intrinsic device. The 
base, collector, and emitter contact (B, C, and E, 
respectively) are marked on the layout........ 
S-parameter measurements (dotted lines) versus 
model of the considered Infineon transistor (solid 
lines) for both a) Amplitude (in dB) and b) Phase 
(in degrees). . o eee ere ee eee 
CB output characteristics of the DUT manufactured 
by IHP; also shown are the pinch-in locus (red 
dashed line) and the stress paths A,B,C. ..... 
Relative base current degradation of the IHP 
HBT(s) as a function of stress time (a) for 

JE stress = 0.12 mA/ jum? and various VGB stress 
(series A), and (b) for Vop stress = 2.75 V and 
various JF, stress (series C). Also shown is extraction 
of exponent a at short and long stress times for 
Selected cases. ona a hos away ak ela ale ad 


177 


Figure 5.3 


Figure 5.4 


Figure 5.5 


Figure 5.6 


Figure 5.7 


Figure 5.8 


Figure 5.9 


Figure 5.10 


Figure 5.11 


Figure 5.12 


List of Figures xxxiii 


Relative base current reduction vs. anneal time 
obtained by applying Tp = 398 K, VCB anneal = 1.5 V, 

and JE anneal = 30 mA/ um? to IHP HBTs previously 
stressed with the bias conditions of Figure 5.2(a) 

(also reported in the legend). The monitoring of 

the base current was performed in situ, i.e., at 

Tp = 398 K for Vgg =0.6 Vand Vcg =0V. ... 190 
Relative base current degradation of the IFX 

device(s) vs. stress time for Vop stress = 2 V and 

VariOUS Jp stress) siei deei ya ea e ee 191 
Relative base current degradation of the IFX 

DUT(s) against stress time for JE stress = 

1.4 mA/um? and various Vcpstress) < <- -à 191 
Relative base current degradation of the IFX 

device(s) as a function of stress time for JE stress = 

1.4 mA/um? and (a) Vopstress = 2.5 V, (b) 
VoB,stress = 2.75 V. Also shown is the extraction of 
exponent a at short and medium times... ..... 192 
(a) Avalanche current density J ay and (b) 

base-emitter junction temperature Tj as a function 

of emitter current density Jg for various Vocps. 

Also shown are the conditions corresponding to the 

stress tests reported in Figures 5.4 and 5.5 (the same 
symbols were used for the sake of clarity). ..... 193 
Output characteristics of the SiGe HBT under test 
simulated using HICUM/L2. Also represented are 

the examined bias conditions (P1, P2, P3, and P23). 194 
Monitoring forward Gummel plots for the DUT 
stressed at P1, P2, and P3. ............. 196 
Evolution of the excess base current as a function of 

stress time for six identical HBTs tested at P1, six 
HBTs tested at P2, and six HBTs tested at P3. .. . 197 
Evolution of the forward Gummel plot with stress 
(aging) time at P23. Shown in the inset is the relative 
variation of Ig at Vgg = 0.65 V. .......2... 198 
Evolution of the reverse Gummel plot with stress 
(aging) time at P23. Shown in the inset is the relative 
variation of Ig at Vgc =0.5 V... aaa aaa 199 


Figure 5.13 


Figure 5.14 


Figure 5.15 


Figure 5.16 


Figure 5.17 


Figure 5.18 


Figure 5.19 


Figure 5.20 


Figure 5.21 


Figure 5.22 


Figure 5.23 


Figure 5.24 


Figure 5.25 


xxxiv List of Figures 


Physical origin of the base current degradation 
represented in a cross section within a TCAD 
environment. 
Forward Gummel plots at Vcg = 0 V at P3 after 
(a) 7 h and (b) 750 h of stress. Measurement results 
(symbols) are compared with the simulated (solid) 
counterparts. 
Trap density evolution along the interface of the 
emitter—base oxide spacer vs. stress time at 

P2 and P3. 
Forward Gummel plot showing the bias range for 
noise measurements. 
Sy, showing different G-R mechanisms and their 
corresponding RTS in time domain for 
transistor#1. 2... 2... 2.2.0... -- 0000.4 
Sy, showing different G-R mechanisms at different 
bias (Vgg) conditions and their corresponding RTS 
in time domain for transistor #1. .......... 
Sy, showing different G-R mechanisms at different 
bias (Vgg) conditions and their corresponding RTS 
in time domain for transistor #7. .......... 
(a) Base RTS at different bias conditions, (b) 
corresponding time constants for the low and the 
high states as a function of bias (Vp) for 
transistor‘ #4) gees ete eR ape Pee a a 
Base noise RTS amplitude (AJ) as a function of 
bias (Ig) for different transistor geometries. 

(a) Collector RTS at different bias conditions and 
(b) corresponding RTS time constants as a function 
of bias (Vcp) for transistors #4 and #5. 
(a) Forward Gummel plots and (b) base current 
degradation of IHP HBTs simulated with an 
empirical aging function (dashed lines). 
New transistor circuit used for aging law 
implementation in HICUM. 
Thermal resistance (Rr) as a function of emitter 
length (Lg) for various emitter widths (Wg), as 
experimentally determined for sets (a) #1, (b) #2, 
and (C) #3. 6 ok on a Seo gis She hn cain 


208 


Figure 5.26 


Figure 5.27 


Figure 5.28 


Figure 5.29 


Figure 6.1 


Figure 6.2 


Figure 6.3 


Figure 6.4 


Figure 6.5 


List of Figures XXXV 


Detail of the 3-D Comsol mesh for the IFX 
transistor with Ap = 0.13 x 2.73 um?, composed 
of 1.35 million tetrahedra of grossly different 
dimensions, corresponding to 1.8 million degrees of 
freedom: Tori x eles a ee ane he ek BS St 219 
Schematic representation (limited to the innermost 
tungsten contacts) of the typical cross section of the 
IEX DULS mopa eae Be oe ee a Re Aas 220 
Thermal resistances as a function of emitter 
width for IFX devices sharing Lg = 2.73 um: 
experimental (squares) values are compared with 
those calculated through the simulation approaches 
A (circles), B (triangles), C (flipped triangles), D 
(rhombi), and E (left-oriented triangles). ...... 222 
Comparison between scalable models (5.24) (dashed 
lines), (5.25) (dotted), (5.32) (solid) with calibrated 
parameters, and experimental Rrus (symbols) for 


set #2 transistors. . . 0... a 227 
Schematic of the broadband Darlington amplifier 
(a) without and (b) with peaking inductor (Lp). . . 237 


Comparison between measured data (symbols) and 
compact model HICUM/L72 results (lines). (a) 
Transit frequency fr vs. Jc for different Vgc 
values. (B) Transconductance gm vs. frequency for 

Vgc = OV and at different Jc = (1, 5, 10, 20) 

MÄI os We eS hk Syl a Dae ea Mg se 238 
Die photograph of the BBA (a) with and (b) without 
peaking inductor. The chip size in both cases is 

(0.245 x 0.18) mm?. 6 eed ar thoes Does 239 
(a) Small-signal gain and (b) stability factor of 

the BBA with and without peaking inductor: 

comparison between simulation (lines) and 

measurement (symbols). .............. 239 
Sensitivity of the series peaked BBA performance 
parameters with respect to HICUM model parameters 

for the transistors (a) Q1 and (b) Q2. ........ 240 


xxxvi List of Figures 


Figure 6.6 


Figure 6.7 


Figure 6.8 


Figure 6.9 


Figure 6.10 


Figure 6.11 


Figure 6.12 
Figure 6.13 


Figure 6.14 


Figure 6.15 


(a) LNA with z-network for input matching, (b) 
small signal equivalent circuit of the LNA with 
feedback resistance Rfg and input matching 

elements. dr e reny ae ee ee a ee 241 
NF and NF yin versus Jc for a SiGe HBT with 

Ago = 0.7 x 0.9 um? for Vcg = 1.2 V at f = 90 


GHZ bd end teas ie Me Bi eee thetic a Meck xs 242 
Schematic of the single-stage wide-band (90-110 
GHZ), ENAS oaa e obo Oo ene A doe geen eo 242 


(a) Small-signal results of the wide-band LNA: 
comparison between measurement (dashed lines) 
and simulation with HICUM/L2 (solid lines). (b) 
Corresponding frequency-dependent noise figure 
NF: comparison between measurements (symbols) 
and simulation of (blue line) and NF min (red dashed 


Sensitivity of the wideband LNA performance 
parameters with respect to HICUM model parameters 

for the transistors (a) QI and (b) Q2. ........ 244 
(a) Transit frequency of an HBT transistor from 

IHP SG13G2 versus its collector current density 

with Vgc varying among -0.5, 0, and 0.5 V; (b) 
Transfer characteristics of an HBT transistor from 

IHP SG13G2 versus its base-emitter DC voltage 

with Vgc varying among —0.5, 0, and 0.5 V. .... 247 
Schematic of the three-stage common-emitter LNA. 248 
Constant available power gain circles (blue curves, 

from 6.15 dB to 5.35 dB with a 0.2 dB step) and 
constant NF circles (red curves, from 3.48 dB to 

4.28 dB with a 0.2 dB step) for the first stage of 

the amplifier at94GHz................ 249 
Measured (symbols) and simulated (lines) results of 

the W-band ultra-low power three-stage LNA: (a) 
S-parameters and (b) noise figure. ......... 249 
(a) Absolute sensitivity of LNA FoMs w.r.t. to series 
resistance variation. Detailed variation of (b) S21, 

(c) minimum noise figure, (d) input referred 

third-order intercept point, all w.r.t. to 

series resistance variation. ............. 250 


Figure 6.16 
Figure 6.17 


Figure 6.18 


Figure 6.19 


Figure 6.20 


Figure 6.21 


Figure 6.22 


Figure 6.23 


Figure 6.24 


Figure 6.25 


Figure 6.26 


List of Figures 


Schematic of the W-band low-power frequency 
triplet. i.e SAP eee Ba A we a 
Layout and corresponding EM simulation views of 
the frequency tripler. 
(a) Input and output return losses of the core part of 
the frequency tripler; (b) Conversion gain (actually 
loss) of the frequency tripler. ............ 
Block diagram of 240 GHz quadrature Tx (a) and 
Rx (b) chipset with identical on-chip ring 

antenna. 
Block diagram of LO signal source consisting 
x16 frequency multiplication over four cascaded 
frequency doublers (D1-D4) and a wideband 
three-stage PA. 
Schematic of the Gilbert-cell doubler stage for 
frequency multiplication. The table mentions the 
passive values and the lengths J, and le for 
microstrip lines used to implement inductors Lp and 
Le, respectively. For D1 and D2, the base tuning 
inductance. ss es o e e e a a a a 
Interstage matching between the doubler stages of 
the multiplier chain for LO generation. The tuning 
inductance Le connected to the collector output 
is not shown, after. Other parameter values are 
mentioned in the table in Figure 6.21. 
Simulated: (a) output power and (b) CG of the 
individual doubler stages. The doublers D1 and D3 
are tuned higher, while D2 and D4 are tuned lower, 
and this resulted in an overall flat response. 
Simulated impedance at the collector outputs of 
D1-D4 derived from the large signal S-parameter 
simulations. . . ooa 
Schematic of the three-stage PA. The transistors 
Q1-Q4 have an emitter area of 8 x (0.96 x 0.12) 
PM asa ge ane te oe le ai ee Sa i 
On-chip power measurements for the standalone LO 
generation source with an Erickson calorimeter for 
input LO power of -10dBm andOQdBm....... 


xxxvii 


252 


253 


261 


xxxviii 


Figure 6.27 


Figure 6.28 


Figure 6.29 


Figure 6.30 


Figure 6.31 


Figure 6.32 
Figure 6.33 


Figure 6.34 


Figure 6.35 


Figure 6.36 


Figure 6.37 


Figure 6.38 


List of Figures 


Simplified metal-level multi-layer geometry for the 
differential quadrature coupler. ........... 
Simulated input match at all four ports of the 
quadrature coupler and isolation between the input 
ports for a differential excitation. All ports are 
referred to a 100-2 differential impedance... .. . 
Up-conversion mixer schematic with additional 
buffer stages for wideband 50-Q IF matching. .. . 
Schematic for the down-conversion mixer. Here, 
additional buffer stages were added to have 

50-Q input impedance required for wideband IF 
matching. 
Schematic of the low-pass filter implemented on a 
ROGERS 4350B PCB material with a thickness of 
0.338 mm. The stepped impedance filter low-pass 
filter is implemented with microstrip lines on the 
PCB. 
The full-EM simulation results from the filter. . . . 
Lens mounted and packaged chip for Tx/Rx 
moditle:. 20. A ee a 
Chip-micrograph of the Tx and Rx chipset. The total 
chip area including the pads is (a) Tx: 1.613 mm? 
(b) Rx: 1.522 mm?. For the on-wafer measurements, 
an auxiliary balun with an estimated 2.5 dB loss has 
been added at the output. . . 2... 2... Le 
On-chip characterization results for (a) the Tx, and 
(b) the: RX. sts a we me A dn tha 
Measurement results from the IF bandwidth 
characterization over a link distance of 90 cm. For 
this measurement, the LO is fixed at 240 GHz and 
the measured 6 dB IF bandwidth is 13 GHz. 
Measurement setup for the high data rate wireless 
communication with arbitrary waveform generator 
(Tektronix AWG70001A) and real-time oscilloscope 
(Tektronix DPO77002SX). ...........2. 
Measured eye diagrams for: 25 Gbps BPSK 
modulation (left); 40 Gbps QPSK modulation 


(ISD sae ee ade a ee ae ne hee 


265 


265 


266 


268 


270 
271 


271 


272 


272 


273 


273 


Figure 6.39 


Figure 6.40 


Figure 6.41 


Figure 6.42 


Figure 6.43 


Figure 6.44 


Figure 6.45 
Figure 6.46 


Figure 6.47 


List of Figures 


Radar transceiver chip implemented in 0.13-um 
SiGe HBT technology and operating at 210-270 
GHz: (a) chip micrograph, (b) block diagram; after. 
The chip size is 2.9 mm x 1.lmm.......... 
Complete radar transceiver module with a copper 
heat sink and a 9 mm silicon lens. 

The incorporated IR-image indicates that the 
chip-on-lens assembly is at around 29°C... .... 
A 3-D EM simulation model of the packaged 
radar module with a silicon chip mounted on the 
back of a 9-mm lens. The chip-on-lens assembly 
is placed inside a rectangular recess in the PCB 
and surrounded by a large ground plane. The slot 
antenna with the differential quadrature coupler in 
the BEOL dielectric stack of a silicon chip is shown 
in the magnified view. The transmit port and the 
receive port are denoted as ‘Tx out’ and ‘Rx in’, 
respectively... cess se . eee ee 
Simulated return loss at the TX port and the 
TX-to-RX leakage for the complete chip-on-lens 
packaged assembly from Figure 6.41. ....... 
Azimuthal view of the antenna co-polar radiation 
pattern at 270 GHz for the radar module operating 
inthe transmit mode. ................ 
Frequency-dependent radiated power; data from. 
Both the total power and the power levels for 
two orthogonal antenna orientations (‘A-plane’ and 
‘B-plane’) are plotted. For plane orientation, please, 
refer to Figure 6.39...............0.. 
Noise figure and conversion gain of the radar 
module for an IF frequency of 33 MHz. ...... 
Architecture of the complete radar under test with a 
metallic plate located atadistanceR......... 
Phase noise of the frequency chirp generator at 15 
GHz driven from the frequency synthesizer Agilent 
E8257D at 1 GHz; after. A frequency of 15 GHz 
corresponds to 240 GHz for the up-converted signal 
at the radar RF output. ............... 


XXX1X 


xl 


List of Figures 


Figure 6.48 


Figure 6.49 


Figure 6.50 


Figure 6.51 


Figure 6.52 


Normalized frequency-dependent power received 
from the calibrating metallic plate after the 
Hilbert-transformed time-domain IF calibration 


Radar response to a metallic plate located at 
a distance of 60 cm from the radar module; 
data from. (a) Magnitude response after amplitude 
calibration only, (b) instantaneous beat frequency 
after the amplitude and phase corrections. The beat 
frequency de-embedded from the peak in the IF 
power spectrum of the return signal is 124.1 kHz 
(see also Figure 6.50). Both frequency and time 
units are shown due to duality of the sweep time 
and the actual RF frequency for a linear 

BMCW fadar- n a bg he ek Bae ahh Gd PE 
The FFT-computed IF spectrum of the calibrated 
beat signal corresponding to the metallic plate 
spaced by 60 cm from the radar module for two 
different operational RF bandwidths: (a) 60 GHz 
and (b) 45 GHz. For comparison purposes, the chirp 
duration was varied for both bandwidths to arrive at 
the same beat frequency of 124.1 kHz. For 60 GHz, 
it was set to 2 ms whereas for 45 GHz it was reduced 
accordingly. The influence of harmonic spurs can be 
identified around 109 kHz and 140 kHz. ...... 
2-D optical scanning test setup for demonstration 
of the radar 3-D imaging capabilities. A total path 
length between the radar module and the object is 
around 780mm. ..............-..-4.4 
3-D imaging experiment with the implemented 
radar module. (a) Cardboard box with a blister pack 
of drugs with two missing tablets as the scanned 
object. (b, c) 2-D scan of the normalized power 
received for an object-to-radar distance of 780 mm 
altogether with the range profiles for two different 
X-Y positions across the cardboard. The positions 
correspond to the present and the missing tablet, 
respectively. Both acquired range profiles show a 
DR of around 50 dB. ................ 


List of Figures xli 


Figure 6.53 3-D surface reconstruction of the object from Figure 
6.52(a) after a peak-search algorithm. The scan was 
appropriately range-gated (770 mm < Z < 788 mm) 
to eliminate the influence of multi-path reflections 
inside the cardboard box and the plastic cavities of 
the blister pack as well as reflections from the front 
Of thé: box. Yen Say AE Be Bb ed 291 

Figure 6.54 Illustration of the THz-CT scanner. The system 
comprises a 490 GHz SiGe-HBT source, an NMOS 
detector, and an optical train based on four f# = 2, 
50 mm PTFE-lenses. The object is rotated () and 


stepped in the 2D object plane (yz). ........ 292 
Figure 6.55 Photograph of the THz-CT scanner. ........ 292 
Figure 6.56 Schematic (a) and micrograph (b) of the 490 GHz 

SiGe-HBT radiator. ..............0.. 293 


Figure 6.57 A 3-dimensional EM simulation model of the 
packaged source module with a silicon chip 
mounted on the back of a 4 mm hyper-hemispherical 


Silicon-lens:, «0 inte A ed a E a a 295 
Figure 6.58 Measured output power and DC-to-RF efficiency 
versus frequency. .............---4.4 295 


Figure 6.59 Measured voltage responsivity and NEP for 
different gate bias voltages and micrograph of 
the detector. The modification of the source/drain 
extension masks shifts the bias point for optimum 
sensitivity to zero volts... 2... 2... 2. ee. 296 
Figure 6.60 Dynamic range of the THz-CT system for different 
chopping frequencies measured with a lock-in 
amplifier with 1 ms time constant. ......... 297 
Figure 6.61 Measured and fitted normalized power at the 
detector for a knife translation in y- and z-directions. 
The Gaussian beam waists are 2.54 mm in 
y-direction and 2.40 mm in z-direction... ..... 297 
Figure 6.62 Tomographic reconstruction of a Y-shaped hook 
driver inside a polyethylene container. The image 
was recorded with 1 mm spatial and 9° angular 
resolution within a 250 min acquisition time... . . 297 


xlii 


List of Figures 


Figure 7.1 


Figure 7.2 


Figure 7.3 


Figure 7.4 


Figure 7.5 


Operating speed comparison between SiGeC HBTs, 
InP HBTs, and MOSFETs vs. critical lithography 
dimensions (i.e., emitter width or channel length): 
(a) maximum oscillation frequency and (b) transit 
frequency. The lines represent LSQ fits of the data, 
the red filled squares DOTSEVEN results, and the 
larger crosses the best InP HBT data......... 
Impact of device connections to other circuit 
elements (a) on the transit frequency of a SiGeC 
HBT with 120 nm emitter window width and a 
MOSFET of the 28 nm node (b). The upper lines in 
(b) represent the pad and pad-device connection line 
deembedded data, while the lower lines represent 
the un-deembedded data. .............. 
Comparison of the terminal (or extrinsic) 
transconductance of transistors from a large variety 
of process technologies. Filled symbols represent 
measured data and open symbols represent predicted 
(roadmap) data. For FETs, the legend designations 
correspond to the channel material and structure: 
planar I-V such as InGaAs (II-V bulk); planar 
single- or few atomic layers such as black 
phosphorus or germanane (2D); planar transition 
metal dichalcogenide such as MoS2 (TMD); planar 
or FinFET silicon (Si-CMOS); nano-wire silicon or 
III-V (NW); carbon nano-tube (CNT). ....... 
Potential applications for silicon integrated mm-wave 
and THz circuits. .. 2... .........00. 
Frequency versus output power of some recently 
published SiGe integrated mm-wave/THz 

SOULCES? 4. ex Hoe. See, oso tok aie, ae ot Sn 


310 


311 


Table 1.1 
Table 1.2 
Table 1.3 
Table 2.1 
Table 5.1 
Table 5.2 
Table 5.3 
Table 5.4 
Table 5.5 


Table 6.1 


Table 6.2 
Table 6.3 


Table 7.1 


List of Tables 


Process modules done by Infineon and IHP 

for the bipolar-only runs (left) and for the full 
BiCMOS process (right) .............0.. 
HBT parameters of EBL HBTs fabricated in joint 
Infineon/IHP flows in comparison to results of EBL 
and DPSA reference flows .............. 
HBT parameters of different process splits ..... 
Definition of the stress bias conditions P1, P2, and P3 
and their corresponding junction temperatures... . 
Stress bias conditions and corresponding junction 
temperatures . . 2... 2... eee eee 
Details of the DUTs for RTS noise measurements . . 
Key figures of the analyzed IFX technology states . . 
Bulk thermal conductivities ............. 
Optimized parameters of the scalable RTH 

models sin ee Yoke tee Wd, Als a E ad 
Comparison of LNA related FoMs for different 
technologies and topologies ............. 
Performance summary of the W-band LNAs.... . 
Performance summary of the W-band frequency 
triplers* 2.03.8 ee sists OS GS eed athe and 
Si-integrated wireless communication links 

above 200 GHz ................000. 


xliii 


Taylor & Francis 
Taylor & Francis Group 


http://taylorandfrancis.com 


ATSF 
BBA 
BC 
BEOL 
BiCMOS 
BJT 
BTB 
BTE 
CB 

CE 
CL-ICPW 
CM 
CMC 
CML 
CMOS 
CMP 
CPW 
DC 
DPSA 
DT 
DUT 
DUV 
EB 
EBL 


List of Abbreviations 


Aging Time Scale Factor 
Broadband Amplifier 
Base-—collector 
Back-end-of-line 

Bipolar Complementary Metal Oxide Semiconductor 
Bipolar Junction Transistor 
Band-to-band 

Boltzmann transport equation 
Common base 

Common emitter 

Capacitively loaded inverted coplanar waveguide 
Compact model 

Compact model coalition 
Current mode logic 
Complementary Metal-Oxid-Semiconductor 
Chemical mechanical polishing 
Coplanar wave guide 

Direct current 

Double-poly-Si self-aligned 
Deep trench 

Device under test 

Deep ultraviolet 

Emitter—base 

Epitaxial base link 
electro-magnetic 
Finite-element method 
Maximum oscillation frequency 
Figure of Merit 

Frames per second 

Transit frequency 

Gigabit per second 


xlv 


xlvi List of Abbreviations 


GCPW 
GICCR 
G-R 
HBT 
HBT 
HC 

HD 

HF 
HICUM 
IFX 
IHP 

II 

IIP3 
IMD 
InP 
ITRS 
LDD 
LNA 
LRM 
LSQ 
MM 
mm-wave 
MOSFET 
NSEG 
P_DC 
PA 
PDK 
PPW 
RF 
RTP 
RTS 
SCR 
SEG 
SEM 
SH 

Si 

SIC 
SiGe 
SOI 


grounded coplanar wave guide 

General integral charge-control relation 
Generation-Recombination 
Heterojunction Bipolar Transistor 
Heterojunction bipolar transistor 

Hot carrier 

Hydrodynamic 

High-frequency 

High Current Model 

Infineon Technologies AG 

Innovations for High Performance microelectronics 
Impact ionization 

Third-order input intercept point 

Inter metal dielectric 

Indium phosphide 

International technology roadmap for semiconductors 
Low-doped drain 

Low-noise amplifier 

Line reflect match 

Least squares 

Mixed-mode 

Millimeter wave 
Metal-Oxid-Semiconductor Field Effect Transistor 
Non-selective epitaxial growth 

Power dissipation 

Power amplifier 

Process design kit 

Parallel plate waveguide 

Radio frequency 

Rapid thermal processing 

Random telegraph signal 

Space charge region 

Selective epitaxial growth 

Scanning electron microscope 
Self-heating 

Silicon 

Selectively implanted collector 

Silicon germanium 

Silicon on insulator 


SOLR 
S-parameters 
SPICE 
SRH 
ST 

STI 
STM 
TCAD 
TE 
TEM 
™ 
TRL 
VCO 
VNA 
W-band 
WP 


List of Abbreviations xlvii 
Short open load reciprocal 

Scattering parameters 

Simulation Program with Integrated Circuit Emphasis 
Shockley-Read-Hall 

Shallow trench 

Shallow trench isolation 

STMicroelectronics 

Technology computer-aided design 

Transverse electric 

Transmission electron microscopy 

Transverse magnetic 

Thru reflect line 

Voltage controlled oscillator 

Vector network analyzer 

90 to 110 GHz frequency range 

Work package 


Taylor & Francis 
Taylor & Francis Group 


http://taylorandfrancis.com 


Introduction 


M. Schréter! 


‘Chair for Electron Devices and Integrated Circuits, Technische Universitat 
Dresden, Germany 

Department of Electrical and Computer Engineering, University of 
California at San Diego, USA 


The semiconductor industry is the fundamental building block of the new 
economy. There is no area of modern life untouched by the progress of micro- 
and nanoelectronics. The electronic chip is becoming an ever-increasing 
portion of system solutions, starting initially from less than 5% in the 1970 
microcomputer era, to more than 60% of the final cost of a mobile telephone, 
50% of the price of a personal computer (representing nearly 100% of the 
functionalities), and 30% of the price of a monitor. In addition to their value 
in terms of cost, semiconductor components are the enablers of new applica- 
tions, such as ABS, location detection (e.g., GPS positioning), smart cards, 
autonomous vehicles, and high-rate data communications (e.g., machine to 
machine), where electronics take over a lot of the essential functionalities 
and offer safety and security. Thanks to advances in nanoelectronics with 
their more than proportional increase of operational capabilities in relation 
to cost, new market opportunities are emerging worldwide, characterized by 
the needs of products and services offering, e.g., mobility, connectivity, and 
security. 

Interest in utilizing the (sub-)mm-wave frequency spectrum for commer- 
cial and research applications has been steadily increasing. Such applications, 
which constitute a diverse but sizeable future market, span a large variety of 
areas such as health, material science, mass transit, industrial automation, 
communications, and space exploration. For the deployment of the respective 
high-performance and partially highly integrated circuits and systems in 
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Figure 1 List of DOTSEVEN project partners and their home countries. 


commercial markets, silicon-germanium (SiGe) heterojunction bipolar tran- 
sistor (HBT) BiCMOS technology is well-suited due to the combination of 
HBT device speed for the high-frequency front-ends with the high integration 
levels of CMOS for digital signal processing [Che17, Sch17]. 

Based on a successful research cooperation in the predecessor project 
DOTFIVE, which produced the first half-THz SiGe HBTs [Hei10], the 
project DOTSEVEN was launched in late 2012 by the European Commission, 
targeting a variety of ambitious goals such as pushing HBT performance 
to 700 GHz (maximum oscillation frequency) and demonstrating working 
systems at 240 GHz. More and specific goals will be detailed later. For 
accomplishing the project’s goals, a consortium consisting of partners with 
complementary expertise was assembled from industry, a research institute, 
and academia (cf. Figure 1). The project was subdivided into four technical 
work packages (WPs), one dissemination WP, and one administrative WP. 

This book provides an overview of the research results of DOTSEVEN. 
It starts in this chapter with the motivation at the beginning of the project 
and a summary of its major achievements. The subsequent chapters pro- 
vide a detailed description of the obtained research results in the various 
areas of process development, device simulation, compact device modeling, 
experimental characterization, reliability, (sub-)mm-wave circuit design and 
systems. 
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Motivation and Objectives of the DOTSEVEN Project 


The DOTSEVEN project proposal was motivated by the increasing inter- 
est in utilizing the mm-wave frequency spectrum within the so-called THz 
gap!, which ranges from 0.3 to 30 THz (cf. Figure 2), for a wide variety 
of applications. Examples for these applications at the beginning of the 
project were >120 GHz industrial sensors including mm-wave scanning and 
radar, extremely broadband ADCs with 50-100 Gs/s and >25 GHz signal 
bandwidth at 5—6 bit resolution, 400 Gb/s optical (backbone) transmission, 
as well as highly linear amplifiers, e.g., for 5G mobile communications. 
These circuits and systems serve a large variety of markets [Sie02, Sie04, 
Ton07, Kuk10, Cool1, Tay11, Sonl1, Kem11, Ajil1, Eis11] such as health 
care and biology (e.g., medical equipment, patient monitoring, tissue and 
genetic screening), infrastructure and construction (e.g., structural safety), 
mass transportation (e.g., security screening, automotive radar, in-seat enter- 
tainment), industrial automation (e.g., sensors), and communications (e.g., 
high-bandwidth terrestrial point-to-point wireless, satellites). The deploy- 
ment of the associated high-performance circuits and systems in commercial 
and military markets is driven mainly by cost, form-factor, and energy- 
efficiency. 

The rapidly increasing interest in THz-applications was documented by 
the start of a new IEEE Journal, namely the “Transactions on Terahertz 
Science and Technology” as well as first business reports (e.g., [Thil1]), 
according to which applications operating in the mm- and sub-mm-wave 
range constitute a diverse but quite sizeable future market. 

The design and implementation of high-speed circuits such as those men- 
tioned above requires individual transistors to be able to operate, i.e., maintain 
power gain, at 3-10 times higher frequencies. This puts their characteristic 
operating frequencies well beyond the previous 500 GHz. Therefore, the main 
objectives of the DOTSEVEN project were: 


e The realization of SiGe:C HBTs operating at a maximum (oscillation) 
frequency up to 700 GHz (i.e., 0.7 THz) at room temperature. 

e The evaluation, understanding, and modeling of the relevant physical 
effects occurring in such high-speed devices and circuits for supporting 
process development and circuit design. 


'The designation “gap” results from the strong drop in signal output power generated from 
both electronic and optical devices in this frequency range. 
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Figure 2 Location of the THz frequency range within the electromagnetic spectrum. The 
overlap region (THz-gap) between electronic and photonic approaches around 1 THz is 
indicated. 


e The design and demonstration of working integrated mm- and sub-mm- 
wave circuits using such HBTs for specific applications as specified 
further below. 

e Establishing and maintaining the European leadership in sub-mm-wave 
SiGe HBT process technology and opening up the mm- and sub-mm- 
wave market to the broader European and international industry. 


In this book, the designations “maximum frequency” or “operating speed” 
of a transistor are synonymous for maximum oscillation frequency fmax 
and CML gate delay TcemL, which — compared to the often used common- 
emitter current gain transit frequency fr — are more relevant transistor-related 
figures of merit (FoMs) for circuit applications and provide more detailed 
performance information.” Nevertheless, the project strived for a balanced 
device design with a reasonable ratio fmax/f7 not exceeding two in order 
to make the developed process technologies applicable to a wide range of 
applications. 


Note that fr and fmax Values are always extrapolated from a single-pole low-pass behavior 
of the respective gain. This definition enables a reliable comparison of device-related speed 
performance between different technologies. 
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Approach toward Achieving the Ambitious Goals 


The research and development aspects of the DOTSEVEN project were 
tackled by four clearly defined and well-connected WPs during a period of 
45 months. Below is a brief description of each WP. Details are discussed in 
the subsequent technical chapters. 

Work package 1 comprised advanced and revolutionary process devel- 
opment with the goal to create a “SiGe HBT process technology platform,” 
subdivided into two major directions. First, at the research institute novel 
ideas were pursued and possibly revolutionary process modules were devel- 
oped with a focus on device architectures that had a high probability of 
significantly improving the device performance. Key aspects addressed here 
were in the areas of collector formation, vertical profile optimization, and 
emitter/base architecture. Second, at the industrial fabrication partner the 
most promising research results were further considered regarding their 
potential integration in an advanced industrial bipolar and later on also in 
a BiCMOS production process flow. In parallel to the above activities, the 
additional passive and active devices necessary for fully integrated mm-wave 
circuits were developed and integrated into the existing process flow. At 
certain stages of the project, wafers and process design kits (PDKs) were 
delivered to the other partners in (i) WP3 for elaborate electrical charac- 
terization, model parameter extraction, and compact model development, 
(ii) WP2 for the calibration and verification of numerical simulation tools, 
and (iii) WP4 for circuit design and characterization as well as building the 
demonstrators. 

Work package 2 was dedicated to “Computational modeling tools” using 
physics-based predictive simulation tools for solving fundamental equations 
numerically, which allow the simulation of electrical characteristics of the 
fabricated structures. The device simulation subset, solving the semicon- 
ductor equations, is also known as “Technology Computer Aided Design 
(TCAD)”. This modeling activity aimed at the continuous support of not only 
the technology development work in WP1 by providing guidelines for opti- 
mizing device performance, but also the development of compact models and 
the generation of virtual electrical characteristics for extracting preliminary 
model parameters for WP4. Advanced hydro-dynamic (HD) transport models 
were employed as a compromise between acceptable computation time and 
accuracy of simulations, while the Boltzmann transport equation (BTE) was 
utilized for predicting the electrical performance and calibrating the HD 
transport model. In addition, three-dimensional thermal and electromagnetic 
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simulations, partially based on in-house tools developed previously, were 
applied for predicting and investigating distributed parasitic effects resulting 
from self-heating and HF operation. 

Work package 3 emphasized on “Device characterization and compact 
modeling” and linked process technology (WP1) with circuit design (WP4). 
This WP addressed the following tasks. First, a common set of optimized 
test structures and new model parameter extraction strategies were devel- 
oped. The second task was dedicated to compact model development for 
overcoming the deficiencies of existing models for sub-mm-wave frequencies 
and applications; also, the large-signal compact model was demonstrated to 
be valid all the way up to 1 THz [Sch14]. Within the third task, based on 
existing hardware and in collaboration with WP1 and WP2, realistic compact 
models representing the target process (i.e., possible target profiles for a 
0.7 THz HBT) and its performance were predicted in order to establish a 
rough guideline for process development. The fourth task was to set up the 
methodology for accurate transistor measurements as well as suitable de- 
embedding and calibration methods at the targeted mm- and sub-mm-wave 
frequencies. The fifth task, again in strong cooperation with WP2 and WP1, 
was to do preliminary investigations of the reliability of the newly developed 
HBTs. 

Work package 4 comprised “Millimeter-wave circuit applications and 
demonstrators”. The objective of this work package was twofold. First, sim- 
ple benchmark circuits were designed in cooperation with WP3 to evaluate 
the developed models under realistic circuit conditions, to allow a comparison 
with other process technologies, and to give circuit designers a good idea 
of the process capabilities for practical applications. The simple nature of 
these circuits provided valuable feedback regarding the impact of specific 
physical effects and the accuracy of the delivered models. In parallel, auto- 
mated procedures for the initial design of these mm-wave benchmark circuits 
were developed to enable their implementation by modeling and process 
engineers. Second, as the major goal, WP4 demonstrated the viability of 
next-generation mm-wave and THz applications by exploiting the devel- 
oped advanced process technology and modeling capabilities. In particular, 
WP4 aimed at building the demonstrators described later in more detail and 
hence indispensable design expertise to spur economic growth in emerging 
mm-wave and THz markets, such as communications, safety, health care 
environment, and security, by taking into account industrial design objectives 
and specifications. 
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In addition to the four technical and scientific work packages, the dissem- 
ination and presentation of the project results were organized by the separate 
WPS. Besides journal and conference publications, project results were pre- 
sented in tutorials and workshops at various conferences. That material can 
be found at www.iee.et.tu-dresden.de/iee/eb/res/dot7/dot7.html. 


Overview of Results and Their Impact 


DOTSEVEN turned out to be highly successful as it even exceeded some of 
its goals at an extremely competitive cost. Significant improvements against 
the previous state-of-the-art were accomplished in all areas of SiGe HBT 
research, which has been documented by the following (non-exhaustive) list 
of technical achievements: 


e Demonstration of SiGe:C HBTs with new room temperature world 
record performance of 720 GHz fmax and 1.34 ps minimum gate delay 
as well as the best ever balanced combination with an fr of 505 GHz 
and a BVcRo of 1.86 V, all based on a 130 nm lithography. 

e Proof of concept of an industrial 130 nm SiGe BiCMOS process 
with (fmax, fr) = (500, 300) GHz and pre-development of a cost- 
efficient industrial 130 nm SiGe BiCMOS process with leading edge 
performance of (fmax, fT) = (360, 250) GHz. 

e Establishment of a full suite of technology computer-aided design 
(TCAD) tools for accurately simulating and modeling advanced SiGe:C 
HBTs as well as for predicting their future performance along with the 
first SiGe HBT technology roadmap for the ITRS/IRDS. 

e Delivery of accurate HICUM/L2 SiGe HBT models for first-time-right 
mm-wave designs. 

e Demonstration of several world record and benchmark circuits at fre- 
quencies up to 500 GHz (triple push oscillator) and fundamental circuits 
up to 240 GHz (fundamental oscillators and amplifiers), including the 
realization of mm-wave power amplifiers above 200 GHz with out- 
put power up to 10 dBm and the successful demonstration of power 
combining techniques in SiGe-based PAs above 200 GHz for the first 
time. 

e Demonstration of the first all-silicon computer tomograph operating at 
490 GHz as well as of a 210-270 GHz 3D imaging system. 

e Demonstration of 240 GHz transmitter and receiver for communications. 
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e Increase of output power and reduction of power dissipation by about a 
factor two for an industry-based 77 GHz automotive radar transmitter. 


Emerging mm-wave and THz markets will benefit from the developed 
0.7 THz SiGe HBT transistors in two ways: (1) Their superior HF perfor- 
mance and possible combination with CMOS integration capabilities make 
them an enabling technology for applications that historically have exhibited 
low integration levels and yield. (2) Economies of scale can be used to provide 
a cost-effective platform for highly integrated subsystems and single-chip 
solutions at mm-wave frequencies and beyond. As such, a 0.7 THz SiGe 
HBT technology targets the system miniaturization at very high frequencies 
to enable the ““mm-wave System-On-Chip” in the future. 

In summary, SiGe:C HBTs have proved their capability to support 
large bandwidth and high data rates for high-speed/high-frequency systems. 
Devices with impressive operating frequency now have been demonstrated 
that only a couple of years ago would have been believed to be reserved for 
II-V technologies. The higher operating speed of SiGe:C HBTs developed 
within the proposed project DOTSEVEN can be leveraged for advanced 
circuits and systems in different ways: 


e They can open up new applications at very high frequencies (THz) using 
harmonics while still providing higher output power than passives used 
today. 

e Their speed can be traded for lower power dissipation. 

e They can be used to mitigate the impact of process, voltage, and temper- 
ature variations (PVT-variations) at lower frequencies for higher yield 
and improved reliability (e.g., in case of automotive radar application or 
high-bandwidth communications requiring highly linear front-ends). 

e The resulting BiCMOS technologies enable the fabrication of complex 
(sub-)mm-wave systems for medium- and high-volume applications. 


The results listed above were obtained through a tight cooperation and 
efficient communication between technology, modeling, and design part- 
ners within and across work packages. For instance, the close cooperation 
between the two technology providers made a fast transfer of IHP’s research 
results into IFAG’s production technology pre-development cycle possible. 
DOTSEVEN has cemented the international leadership of the European mm- 
wave community. The results mentioned above were achieved with a total EU 
contribution of just 8.6 M€ and an overall cost of just 12.3 M€, which is 
extremely small compared to the development of III-V technologies and in 
particular RF-CMOS, especially when considering the fact that as of today 
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still no RF-CMOS process exists with HF performance that is even close to 
the one of DOTSEVEN. A more detailed technology comparison is given in 
the Outlook chapter. 

In summary, DOTSEVEN has played a unique key role in the develop- 
ment of a new generation of high-speed silicon technologies and has pushed 
Europe to the forefront of the world-wide competition. The project benefited 
significantly from the strong links between research and industry. It enabled 
to pursue the broad range of activities from device and material research to the 
system implementation and demonstration simultaneously and with sufficient 
critical mass, thus opening up the path to a rapid exploitation of the results 
and economic growth on the one hand and providing the means for educating 
a highly skilled work force in the mm- and sub-mm-wave application area, 
on the other hand. Overall, DOTSEVEN demonstrated clearly that seriously 
advancing high-frequency electronics is possible with relatively low cost 
(compared to CMOS and II-V technology) by assembling a suitable multi- 
disciplinary consortium, efficient organization, and defining meaningful and 
realistic goals. 
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1.1 Introduction 


Advances in silicon—germanium (SiGe) heterojunction bipolar transistor 
(HBT) technologies resulted in an impressive increase in high-frequency 
performance during the last decade extending the addressed application 
frequencies into the mm- and sub-mm-wave bands. Today, SiGe HBTs are 
widely used for applications like automotive radar, high-speed wireless and 
optical data links, and high-precision analog circuits. BiCMOS technologies 
which comprise high-speed SiGe HBTs in a radio-frequency (RF) CMOS 
technology environment combine the excellent RF performance of SiGe 
HBTs with the high level of integration and the high computing power of Si 
CMOS. These technologies became a key enabler for demanding mm-wave 
systems which integrate radio front-end circuits together with digital control 
circuits and signal processing on a single chip. Previous development has 
demonstrated that SiGe HBTs continue to offer significantly higher cutoff fre- 
quencies, higher output power, and superior analog characteristics compared 
to CMOS transistors of the same lithography node. Thus, the integration of 
SiGe HBTs in a CMOS platform represents a very attractive option to boost 
the RF performance of a given technology node. 

The state of the art of SiGe HBT technology before the start of the 
DOTSEVEN project in October 2012 was reviewed in [Che11]. Develop- 
ments performed within the predecessor project DOTFIVE resulted in the 
first demonstration of SiGe HBTs with maximum oscillation frequencies, 
fmax. of 500 GHz together with transit frequencies, fp, of 300 GHz and 
minimum ring oscillator gate delays of 2.0 ps [Heil0]. This was the starting 
point of the DOTSEVEN project addressing the challenging target for SiGe 
HBTs with peak fmax values of 700 GHz and minimum gate delays of 


11 


12 SiGe HBT Technology 


1.4 ps. Figure 1.1 summarizes published peak fr and fmax values of selected 
high-speed SiGe HBT processes from the last decade. BiCMOS technologies 
with peak fmax values between 300 GHz and 400 GHz and peak f'r values 
of 230-320 GHz are in production or pre-production at several companies 
now. Recent research results have demonstrated that this performance can 
be increased much further. The values obtained within the DOTSEVEN 
project are indicated as red diamonds in Figure 1.1. In 2015, two separate 
investigations demonstrated new record values for fmax [Boe15] and fr 
[Kor15]. Further technology optimization finally enabled the demonstration 
of the DOTSEVEN goal including the simultaneous realization of peak fr 
and fmax values of 505 GHz and 720 GHz, respectively [Heil6]. 

The reminder of this chapter is organized as follows. Major performance 
factors of SiGe HBTs are reviewed in the section “HBT Performance Fac- 
tors.” Fundamental dependencies of typical high-frequency figures of merit 
(FoMs) on device parameters are discussed here. The section “HBT Device 
and Process Architectures Explored in the DOTSEVEN Project” addresses 
device architectures for high-performance SiGe HBTs and process inte- 
gration aspects. Favored process options for HBTs with selective epitaxial 
growth (SEG) and with non-selective epitaxial growth (NSEG) of the SiGe 
base layer are analyzed in detail. The focus is on the work performed in the 
DOTSEVEN project concerning the development of the high-performance 
SiGe BiCMOS technology platform B11HFC at Infineon (see the section 
“DPSA-SEG Device Architecture”), the investigation of an advanced HBT 
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Figure 1.1 Peak fr and fmax values of high-speed SiGe HBT technologies. Red diamonds 
indicate results of the DOTSEVEN project. 
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process with SEG of the base and epitaxial base link (EBL) regions (see the 
section “Approaches to Overcome Limitations of the DPSA-SEG Architec- 
ture”), and the optimization of a process with NSEG of the base (see the 
section “Non-selective Epitaxial Growth of the Base”), which was finally 
utilized by IHP to reach the DOTSEVEN goal. The section “Optimization of 
the Vertical Doping Profile” addresses the optimization of the vertical doping 
profile for f'r improvement. The final technology optimization for minimum 
device parasitics and balanced fp and fmax improvement is discussed in the 
section “Optimization towards 700 GHz fmax.” 


1.2 HBT Performance Factors 


Typical FoMs characterizing a process technology in terms of high-frequency 
performance are the transit frequency fr and the maximum oscillation fre- 
quency fmax. The transit frequency fr is defined as the frequency for which 
the small-signal current gain |/;|falls to unity, i.e., 


[ho (Plr =1. (1.1) 


The frequency fmax is defined as the maximum frequency for which the 
transistor can amplify power. In this context, Mason’s unilateral power gain 
U is widely used, and fmax is defined by: 


AETR =1. (1.2) 


While the frequency fmax represents a speed metric for circuits such as 
amplifiers and oscillators, f'y gives a measure of the speed of switching 
circuits such as dividers. Ring-oscillator gate delays are relevant FoMs for 
digital high-speed circuits. Here, we use the current mode logic (CML) 
ring-oscillator gate delay time tc. In addition, the base—collector break- 
down voltage BV cgs and the open-base emitter—collector breakdown voltage 
BV ceo are important since they determine the maximum output power that 
can be provided by a transistor. Further characteristics of relevance for eval- 
uating potential applications include the minimum noise figure, the linearity, 
and the gain of a transistor. 

In the following, we are going to discuss the impact of different 
device regions and their electronic properties on RF performance. Basic 
device regions are indicated in the generic cross section of a high- 
performance SiGe HBT shown in Figure 1.2. For the analysis of the con- 
tribution of the individual device region to the delay time of the transistors 
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Figure 1.2 Schematic cross section of a high speed SiGe HBT. 


response to an RF signal, it is helpful to relate the resistances and capacitances 
of the device regions to a simplified small-signal compact model. 

Figure 1.3 indicates the resistances and capacitances of the individual 
device regions and relates them to a small signal equivalent circuit for 
the transistor operation in forward active mode. The model includes the 
resistances Rp, Rp, and Rc, of the emitter, base, and collector, respectively. 
The base resistance is divided into an intrinsic contribution Rp; and extrinsic 


Extrinsic 
EB region 


Extrinsic 
BC region 


Figure 1.3 Device cross section with parasitic resistances and capacitances associated with 
different device regions (a) and a corresponding small signal equivalent circuit (b). 
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contribution Rpx originating from the link region which contacts the intrinsic 
base. The base—collector capacitance (Cgc) is divided into an intrinsic part 
Cgc; and an extrinsic part Cpcx related to the base link region. Cpp includes 
the depletion capacitance as well as the oxide capacitance of the base-emitter 
junction, C gig is the diffusion capacitance related to the storage of minority 
charges in the forward operation mode, p is the forward DC current gain, 
8m is the transconductance, Rr, is the output resistance related to the Early 
effect, and V’pp is the intrinsic base-emitter voltage. 

The frequency-dependent small signal current gain h2\(f) of the model 
depicted in Figure 1.3 is approximately given by: 


: z + j2rf ( 
—— Z — 3J T 
hai (f) t 

In the limit of large frequencies, h21 is inversely proportional to the frequency 
f. The corresponding unit gain transit frequency f'r is given as: 


Ca + Cpr + CBC is 
Im 


(Re + Ro) Cnc) C133) 


1 Cait + Caz + Cac 
Qn fT gm 


+ (Re + Rc) Cgc. (1.4) 


The transconductance gm is proportional to the collector current Ic in the 
bias region of ideal exponential slope according to gm = qIc/kgT, where q 
is the elementary charge, kp is Boltzmann’s constant, and T is the junction 
temperature. The diffusion capacitance Cq; accounts for the storage of 
locally compensated minority carriers during forward transistor operation. 
The contribution to Cgjg can be analyzed in a charge-control model [Tau98]. 
This analysis relies on the fact that any variation of the bias point of the device 
is related to changes of the carrier densities within the device which are fed 
by currents into the device contacts. The corresponding forward transit time 


t Caig 
Im 


TF = TE + TEB + TB +TBC (1.5) 


can be divided into contributions accounting for charge storage in the emitter, 
base—emitter junction, base, and base—collector junction regions, respectively. 
According to the charge-control model, these contributions are approximately 
given by: 


ESS; (1.6) 


TEB = —; (1.7) 
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TB = + (1.8) 
2DnB Usat 
WBC 
Taz . 1.9 
BC = a (1.9) 


Here, wg is the width of the neutral base region, wgc is the depletion width 
of the base—collector junction, Dyp is the electron diffusion coefficient in the 
base, and Vsat is the saturation velocity of electrons. Cz and Cy denote 
the parts of the diffusion capacitance related to neutral charge storage in 
the emitter and base-emitter junction regions, respectively. The compensated 
charge Cy stored in the base-emitter junction can account for a significant 
contribution to tp in particular at high current densities [Hue96]. The emitter 
delay time tp is of minor importance for typical SiGe HBTs since the amount 
of holes stored in the emitter is inversely proportional to the current gain. 
The magnitude of Cp is determined by the emitter properties. The maximum 
oscillation frequency of the equivalent circuit of Figure 1.3 is approximately 
given by: 


fr 
= : 1.10 
fmax J= ((Rpx + Rpi) Cpi + RgxCBx) Eo 


This relation is reduced to: 


a AT 
fmax S 8nRnOnc’ (1.11) 


if Rp and Cgc are not separated into extrinsic and intrinsic contributions. 
Based on the Equations (1.4) to (1.10), the following scenario can be 
envisioned for the enhancement of the cutoff frequencies fy and fmax by 
scaling vertical and lateral device dimensions. The transit frequency fr is 
predominantly determined by the vertical doping profile. Figure 1.4 illustrates 
qualitatively the directions of profile optimization for f'r enhancement. 
Reduction of the width wp of the boron-doped base reduces the base 
transit time tp according to Equation (1.7). A minimum width of the boron- 
doped region has to be ensured together with a low base sheet resistance. 
Today, base layers with typical sheet resistances of about 2 kQ/sq can be 
grown epitaxially with widths of less than 5 nm. In addition to the deposition 
of a thin base, its diffusion during subsequent processes has to be kept as 
small as possible. A widely applied approach to minimize B diffusion is 
the additional doping of the SiGe layer with carbon [Lan96, Ost97, Rue99]. 
Moreover, the thermal budget of post-epi processing has to be kept low. 
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Figure 1.4 Schematic vertical doping profile of a SiGe HBT. The dashed lines indicate a 
scaled profile for enhanced fr. 


The challenge here is to realize simultaneously high dopant activation in 
heavily doped device regions and minimum diffusion broadening of the base. 

Together with the base width, the width of the Ge profile is also shrunk. 
This allows one to increase the peak Ge concentration without exceeding 
the critical thickness of the SiGe layer above which the SiGe layer becomes 
thermodynamically unstable against the formation of dislocations. For low 
base transit time tp, it is beneficial to realize a steep gradient of the Ge 
concentration across the non-depleted base width wg. The grading of the Ge 
profile as indicated in Figure 1.4 causes a built-in electric field due to the 
decrease of the band gap with increasing Ge content. This field accelerates 
minority electrons in the base and reduces tp. 

Reduction of depletion width wgc of the base—collector junction is a 
measure to reduce the base—collector transit time tgc. This reduction of tge 
has to be traded off against an increased base—collector capacitance Cgc; and 
a reduced base—collector breakdown voltage due to reduced wgc. 

The neutral charge storage Cy in the base-emitter region can be reduced 
by decreasing of the base-emitter depletion width wgpg in conjunction with an 
optimized Ge-profile in the base-emitter junction region. However, reduction 
of wpp results also in a reduction of the base-emitter breakdown voltage 
BV £go and in tunnel currents at low base-emitter voltages. These effects 
have to be traded off against the reduction of Cy. 

The doping profiles of the non-depleted emitter and collector regions 
are optimized for low resistivity due to high concentrations of electrically 
active dopants. The lowest emitter resistances are typically achieved with 
mono-crystalline emitters. In addition to the above mentioned measures for 
fr enhancement by vertical profile engineering, one also has to minimize 
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contributions to the base-emitter and base-collector capacitances originating 
from the edges of the device for realizing higher fr values according to 
Equation (1.4). 

For realizing high fmax values, it is crucial to minimize the base resis- 
tance and the base-collector capacitance together with high fr values as 
indicated by Equation (1.9). The required reduction of these device parasitics 
is typically addressed by scaling lateral device dimensions and by optimizing 
the base-link regions. Figure 1.5 illustrates relevant lateral device dimensions 
and major contributions of the base-link region to Rp and Cgc. 

Contributions to the extrinsic base resistance originate from the extension 
of the base layer below the base-emitter spacer, from the adjacent mono- 
crystalline or poly-crystalline p-doped region, the contact resistance between 
silicide and base poly-Si, the silicide resistance, the contact resistance 
between silicide and metal contact plug, as well as from the resistance of 
subsequent metal regions. The extrinsic base—collector capacitance includes 
capacitances of the mono- and poly-crystalline extrinsic base regions to 
the selectively implanted collector (SIC), the buried collector layer in the 
active region, and the buried collector below the base—collector isolation 
layer. Reduction of these parasitic resistances and capacitances is addressed 
by reducing the corresponding lateral device dimensions such as the width 
of the base-emitter spacer dsp, the width of the emitter poly-Si wep, and 
the width of the active collector region wco). However, depending on the 
details of the device architecture, there are several tradeoffs between the 
different parameters. For example, the reduction of wc) can lead not only 
to a reduction of Cgc, but also to an increase of Rpx. 


Figure 1.5 Device cross section with lateral device dimensions and major contributions of 
the base-link region to Rg and Cgc. 
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The intrinsic contribution to the base resistance Rp; can be reduced for a 
given base sheet resistance Rsp; by narrowing the emitter window width wp. 
In a typical scaling scenario, lateral scaling of wg is accompanied by vertical 
scaling of the doping profile and increased current densities at peak fr. Under 
these conditions, it is a major challenge to maintain low emitter and collector 
resistances as well as thermal resistances when the emitter width is scaled 
down. 

Scaling of the HBT device dimensions under the boundary conditions of 
minimum base, emitter, and collector resistances imposes complex require- 
ments on device architecture and fabrication process. These challenges 
have stimulated various innovations of the HBT fabrication process which 
addressed the reduction of the individual device parasitics by structural 
improvements as well as by improved material properties such as reduced 
specific and contact resistances. Approaches explored in the DOTSEVEN 
project will be reviewed in the following sections. 


1.3 HBT Device and Process Architectures Explored 
in the DOTSEVEN Project 


Innovations of the device architecture and of the fabrication processes have 
been major factors for the improvement of the RF performance of SiGe HBTs 
during the last decades. Fundamental requirements on the device architec- 
ture for high-speed HBTs are minimum access resistances to the intrinsic 
emitter, base, and collector regions together with low contributions of the 
extrinsic device regions to the base—collector and base-emitter capacitances. 
The development of device and process architectures which facilitate the 
simultaneous realization of low Rp and low Cgc has been a major challenge 
in this context. The realization of devices with low thermal resistances is a 
further requirement in order to limit self-heating. 

The above-mentioned device targets have to be realized in fabrication 
processes which are manufacturable in high volumes with high yield. A 
further fundamental requirement on the HBT fabrication process is the 
compatibility with the addressed CMOS technology platform. The integra- 
tion of SiGe HBTs and other RF-enabling passive or active devices into a 
BiCMOS technology platform has to be realized without degrading HBT 
or CMOS device characteristics or yield. The large potential of advanced 
CMOS processes for geometry scaling opens new options also for the HBT 
fabrication. However, new challenges arise for the integration of SiGe HBTs 
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in continuously shrinking CMOS nodes from tight constraints on the thermal 
budget and on device topology. 

As regards the SiGe HBT device concepts, all current production-related 
high-speed transistors take advantage of the so-called double-poly-Si (DP) 
architecture. This configuration provides access from the contact region to 
the intrinsic base and emitter region by poly-Si layers which are dielectrically 
isolated against the surrounding transistor regions. It is a powerful means 
to keep extrinsic parasitics, such as Rpx, Cpcx, Rg, and Cpr, small. It is 
therefore evident that the basic structure of modern SiGe HBTs is becoming 
more similar. Nevertheless, we are faced with quite different approaches for 
device manufacturing resulting in different consequences of their potential 
electrical performance. 

A key differentiator for SiGe HBT fabrication is the way in which the 
SiGe base is formed. Existing SiGe HBT technologies use either selective or 
non-selective epitaxial growth of the base. Both approaches have been used 
for the development of high-performance SiGe HBT processes and found 
their way into industrial mass production. In the DOTFIVE project, tech- 
nological solutions were developed promising further speed enhancements 
for HBT concepts with selective as well as with non-selective base epitaxy. 
Due to their specific implications on the process complexity and the self- 
alignment of the transistor regions, various technologies were investigated 
also in the DOTSEVEN project using the different base-epitaxy methods. 
Opportunities and challenges of the two approaches will be discussed in detail 
in the following subsections. This applies also to process options regarding 
the lateral collector isolation by deep trenches or by the standard shallow 
trenches of the CMOS process, the formation of the highly conductive sub- 
collector, and the formation of the base-emitter structure. The choice of the 
substrate, i.e., bulk or silicon-on-insulator (SOD), is another criterion to dif- 
ferentiate SiGe HBT technologies. Driven by the continuous development of 
SOI-based CMOS technologies several publications have been devoted to the 
issue of a suitable technology and device concept for high-speed SiGe HBTs 
on SOI wafers [Was00, Rue04, Ave05, Thil3]. Here, we will not address 
this architectural aspect because it was outside the focus of the DOTSEVEN 
project. 


1.3.1 Selective Epitaxial Growth of the Base 


The classical DP-self-aligned (SA) SiGe HBT technology with SEG of the 
base represents the most attractive process architecture from the point of 
view of manufacturing and degree of self-alignment. As described in the next 
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section, the DOTFIVE project partners Infineon and STMicroelectronics as 
well as FreeScale (now NXP) worked intensively on this concept in the last 
decade to push its performance. However, substantial improvements beyond 
the current level will hardly be feasible with this approach as discussed in the 
section “Approaches to Overcome Limitations of the DPSA-SEG Architec- 
ture.” In the DOTFIVE project, alternative SEG process flows were developed 
to overcome limitations of the conventional DPSA-SEG technology. It will 
be reported below in detail on joint activities of Infineon and IHP in the 
DOTSEVEN project to test one of these approaches within Infineon’s 130 nm 
BiCMOS platform. 


1.3.1.1 DPSA-SEG device architecture 

The conventional double-poly-Si self-aligned SiGe HBT technology with 
SEG enjoys large popularity and has been applied in production for long time 
by several companies [Boe04, Ave09, Joh07]. This process takes advantage 
from the fact that only one lithographic step is needed to completely construct 
the internal transistor. In principle, no further mask step is necessary to form 
the SIC region or the isolation between emitter and base. Usually, this process 
starts with the deposition of a layer stack comprising a bottom oxide, a pt 
poly-Si layer, an upper oxide, and a capping nitride. The emitter window is 
opened by dry etching which stops at the bottom oxide (Figure 1.6(a)). Nitride 
spacer formation will prevent from pulling back the upper oxide during the 
subsequent oxide wet etching. By this step, the intrinsic collector region is 
exposed and an overhang of the p* poly-Si is created (Figure 1.6(b)). At this 
point the SIC can be formed which provides a low-ohmic connection to the 
highly doped sub-collector. In addition, the nitride inside spacers protect 
against Si seeding of the pt poly-Si during the following base epitaxy. 
With this step, the link between the intrinsic base and the extrinsic part 
(pt poly-Si = base poly) is formed (Figure 1.6(c)). Whether the nitride layers 
are removed at a later stage or not is handled differently [Che1 1]. 

The remaining steps are very common also for other HBT processes such 
as technologies with non-selective base epi. Inside base-emitter spacers are 
formed and the in situ doped emitter layer is grown with a non-selective 
epitaxial step. Therefore, at least partly, a mono-crystalline emitter region 
can be found adjacent to the substrate surface already after deposition. The 
HBT flow is continued by patterning the emitter and base poly-Si layers. 
Finally, a short-term high-temperature treatment is needed in order to out- 
diffuse dopants from the highly doped emitter layer into the base cap layer 
before the process flow is completed by salicidation and backend fabrication 
(Figure 1.6(d)). The final annealing step contributes not only to improved 
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Figure 1.6 Schematic cross sections of conventional DPSA-SEG process flow. 


base-current idealities but it also determines the emitter resistance. In the case 
of a bipolar-only technology, e.g., Infineon’s B7HF200 [Boe04], the thermal 
budget is largely governed by the needs of the HBT. In BICMOS processes, 
as a rule, the minimum requirements of the source—drain anneal determine the 
crucial thermal budget of the HBT assuming that a ‘source/drain-after-HBT’ 
integration scheme is realized. 

The basic structure of the aforementioned DPSA-SEG process flow was 
already employed for the first demonstrations [Sat92, Mei95, Pru95]. Essen- 
tially, it has been maintained to date. In Figure 1.7, a TEM cross section of 
Infineon’s latest SiGe HBT transistor generation is shown. 

In [Che11], certain differences between the developments of different 
companies are pointed out. For example, Infineon utilized temporary auxil- 
lary spacers to adjust the area of the SIC whereas the SIC was performed 
right after the emitter window opening at STMicroelectronics. There are 
also deviations in the annealing regime of the base poly. In the Infineon 
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Figure 1.7 TEM cross section of a DPSA-SEG HBT of Infineon’s B11HFC technology 
[Boe15]. 


flow, an extra thermal treatment after base epitaxy is introduced for out- 
diffusing B from the base poly toward the intrinsic base layer. Consequently, 
the base resistance can be reduced but the base tends to broaden causing 
lower fr values. Nevertheless, variations of this annealing showed room for 
optimizations to increase fmax With tolerable constrains for f'y [Can12]. 

The collector construction used by Hitachi [Has14], Infineon [Boe15], 
and STMicroelectronics [Che14] includes all elements which are typical for 
a high-speed Si bipolar transistor: an epitaxially buried, highly doped sub- 
collector isolated laterally by deep trenches and a low-ohmic connection 
to the collector contact realized by a so-called collector sinker. In order to 
save fabrication efforts and to reduce complexity of the BiCMOS process, 
NXP (former FreeScale) developed a Sub-Isolation Buried Layer (SIBL) 
collector structure using only shallow trench isolation (STI) [Joh07]. A 
common feature of all these technologies is the shallow-trench isolation (STI) 
between the internal transistor region and the collector contact which enables 
simultaneously a low capacitive base link. 

In the DOTFIVE project comprehensive efforts were made by Infineon 
and STMicroelectronics to improve the high-frequency behavior of the con- 
ventional DPSA-SEG technology as reported in [Che11]. Clear progress was 
achieved by changing the vertical profile, thermal treatments, as well as the 
lateral transistor dimensions. At that time, Infineon was able to increase its 
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initial fr/fmax values of 190 GHz/250 GHz (B7HF200, [Boe04]) finally 
to 230 GHz/350 GHz while STMicroelectronics increased these FoMs from 
230 GHz/280 GHz (BiMOS9MW, [Ave09]) to 260 GHz/400 GHz. Later, 
both companies could demonstrate this performance level in a BiCMOS 
environment too. The fmax results obtained in the DOTFIVE project and 
in the recent past for the DPSA-SEG concept [Has14, Che14, Tril6] indicate 
that it is difficult to reach values beyond 400 GHz. The limited possibilities 
to decrease Rgx have been proved as the key bottleneck for advancements of 
the overall RF performance. Alternative concepts which were conceived to 
overcome this issue will be presented in the next subsection. 


1.3.1.2 Approaches to overcome limitations of the DPSA-SEG 
architecture 

If the geometry of a conventional DPSA-SEG SiGe HBT (see Figure 1.6(d)) 
is shrunk according to scaling scenarios for next technology nodes, as it was 
carried out in TCAD studies [Schl 1b, Sch17], there is no serious indication 
for an imminent end of performance progress compared to other technology 
approaches. In practice, however, we are confronted with the fact that the 
attempts to increase fmax did not go beyond values of 400 GHz while 
alternative concepts indeed surpassed this level. Unfortunately, the main 
reason for this deficiency is closely connected with the key advantage of the 
conventional DPSA-SEG process, namely the straightforward manufactur- 
ing of the link between the intrinsic transistor region and the base poly-Si 
layer. 

The vertical gap between the substrate surface and the base poly is 
bridged during the selective growth of the base layer (Figure 1.8(a)). Obvi- 
ously, the base layer and the p+ base poly are separated after base epitaxy 


Figure 1.8 Different base link configurations of selective epitaxial growth (SEG) HBTs. 
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by a higher ohmic region which has to be eliminated by B in-diffusion 
from the base poly. Additionally, it would be beneficial if dopants 
could be introduced in the un-doped Si cap layer beneath the emitter- 
base spacers. Several attempts have been made to solve this problem. 
For example, boronsilicate-glass EB-spacers and a dedicated anneal at 
800°C for 10 min were utilized by NEC [Sat92] to overcome this 
issue for one of the first DPSA-SEG technology developments deliver- 
ing fr/fmax values of about 50 GHz. STMicroelectronics tested soak 
annealing for a few seconds at 1,010°C to 1,040°C between base and 
emitter deposition for an advanced DPSA-SEG version [Can12]. An 
improvement of the fr/fmax/CML gate delay values from 300 GHz/ 
370 GHz/2.33 ps to 320 GHz/390 GHz/2.2 ps was shown for a split with 
additional annealing at 1,010°C in combination with a reduced B dose of 
the base layer which was applied to compensate for stronger B broadening 
compared to the reference process. 

Other concepts tried to include the region which surrounds the base 
layer for a lateral connection (Figure 1.8(b)) in addition to the standard 
configuration with a vertical link to the base poly. Such an approach could 
mitigate the effect of the decreasing contact area between the intrinsic base 
layer and the base poly-Si with ongoing lateral scaling. For this purpose, a 
second poly-Si layer was inserted in the layer stack enclosing the emitter 
window in [Was03]. An extra selective epi step was implemented in [Fox08] 
for positioning of the base poly in lateral direction related to the SiGe base. 
A simple and meanwhile widely used measure to increase the contact area 
between the base poly and the base layer is, to some extent, the introduction 
of 45° rotated substrates. In this way, the emitter windows are aligned 
along the <100> crystal orientation and the formation of unfavorable facets 
during epitaxial growth of the base is minimized. However, the progress of 
above mentioned approaches on RF performance was limited apart from the 
increased process complexity or the disadvantage of a higher thermal budget. 
In particular, these attempts did not achieve the progress that might be theo- 
retically possible for highly conductive mono-crystalline base-link region. In 
this respect, the HBT module with EBL (Figure 1.8(c)) represents an uncon- 
ventional SEG approach which brought substantial progress for decreasing 
the specific Rpx contribution without dampening the prospects for high fr 
values. 

A detailed description of the EBL fabrication process can be found in 
[Fox11]. The major steps are described below. Figure 1.9(a) shows a cross 
section after emitter formation. The emitter window was etched in a layer 
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Figure 1.9 Schematic cross sections of the EBL process flow after emitter structuring (a) 
and after selective growth of the EBL (b). 


stack consisting of a lower oxide, a sacrificial nitride, an upper oxide, and 
a top nitride layer that is already removed at the state of Figure 1.9(a). At 
this point, there are two specific features which differ from a conventional 
DPSA-SEG flow. First, a sacrificial nitride layer is deposited instead of the 
base poly-Si. Second, an overhang of the emitter poly-Si over the emitter 
window was created in a self-aligned manner by pulling back the upper oxide 
layer before EB spacer formation. Emitter poly-Si structuring is completed 
by chemical—mechanical polishing (CMP) similar to [Fox08]. The key idea 
to form the EBL is illustrated in Figure 1.9(b). After emitter CMP, the devices 
are covered by an oxide layer, and then the cover oxide and the upper oxide 
are patterned by a masked dry etching step before the sacrificial nitride is 
removed by wet etching. The resulting cavities are filled by SEG of B- 
doped Si as illustrated in Figure 1.9(b) followed by non-selective growth of 
B-doped Si. 
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In the first publication on this technology concept [Fox11] ft/fax/CML 
gate delay values of 300 GHz/480 GHz/1.9 ps were presented. However, a 
detailed comparison of the EBL HBT performance against standard DPSA- 
SEG results was not in the focus of this first demonstration. This assess- 
ment has been addressed in the DOTSEVEN project. The EBL HBT was 
compared directly with the conventional DPSA-SEG approach based on 
identical collector designs, transistor layouts, and measurement conditions. 
For this purpose, the EBL HBT module was implemented in Infineon’s 
0.13 wm BiCMOS environment which includes the standard collector con- 
cept of an epitaxially buried sub-collector and deep-trench (DT) isolation 
combined with a mm-wave Cu back-end-of-line (BEOL). In contrast, the 
original EBL process comprised IHP’s DT-free collector approach [Hei02] 
using STI-isolated, highly doped collector regions as well as an Al-based 
BEOL. 

For this exercise, EBL HBTs and conventional DPSA-SEG HBTs were 
compared in two ways. First, IHP manufactured its novel device on Infineon 
wafers in a bipolar-only flow. The joint fabrication started at Infineon by 
forming the buried sub-collector, the deep trench and STI, and the MOS gates 
(Table 1.1). Then, the wafer processing was continued at IHP with the EBL 
module. The CMOS fabrication steps after the HBT module, which could 
deteriorate the HBT performance, were skipped in these experiments. All 
process steps for the bipolar devices including the final activation annealing 
and salicidation were done in these runs at IHP. Compared to [Fox11], the 
emitter—base spacer process was slightly modified to assist the formation of 
reduced emitter widths. To eliminate the risk of poly-Si residues, an extra 
mask was introduced to remove the emitter poly-Si outside of the transistor 
regions before emitter planarization. Otherwise, we preserved the original 
EBL flow including the thermal treatment and doping of the SIC, SiGe base, 
and emitter. 

In a second cycle, the full BiCMOS flow was applied. The HBT fab- 
rication was finished at IHP with removing the CMOS protection layer. 
The further processing corresponded to Infineon’s 0.13-um BiCMOS pro- 
cess including low-doped-drain implantation and annealing, CMOS gate 
spacer deposition, source/drain implantation and annealing, and salicidation. 
Table 1.1 shows which process modules were carried out by Infineon and 
which by IHP for the bipolar-only flow and for the full BiCMOS process, 
respectively. 
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Table 1.1 Process modules done by Infineon and IHP for the bipolar-only runs (left) and for 


the full BiCMOS process (right) 


Bipolar-only BiCMOS 
Infineon IHP Infineon IHP 
Buried layer, Epi Buried layer, Epi 
Isolation Isolation 
Wells Wells 
Gate Gate 
CMOS protection CMOS protection 
SiGe HBT SiGe HBT 
LDD 
MOS spacers 
S/D 
Final anneal Final anneal 
Salicide Salicide 
IMD IMD 
Contact Contact 
BEOL BEOL 


To assist the BiCMOS integration, several adjustments of the EBL HBT 
module compared to the bipolar-only runs were made: 


e The number of SIC implants for the high-speed HBTs and subsequently 


the total dose was reduced. 


e The base profile thickness was slightly increased to make the profile 
more robust against the additional thermal budget caused by the CMOS 
integration. 

e The emitter doping was reduced and the Si-cap thickness of the SiGe 
base deposition was adjusted to compensate for potentially enhanced 
emitter diffusion due to CMOS integration. 

e The effective emitter width was slightly reduced using optimized pro- 
cesses for emitter window lithography and etching to support lateral 
scaling. 

e A new laterally scaled emitter-base spacer process was introduced to 
reduce the base link resistance. 


Figure 1.10 shows the resulting cross sections of the emitter-base complex 
in the bipolar-only process and the full BiCMOS runs. The effective emitter 
width amounts to 130 nm for the bipolar-only process and 120 nm in the 


BiCMOS flow. 


In the following, electrical properties of the joint Infineon/IHP HBT fab- 
rications, i.e., bipolar-only and BiCMOS EBL, are evaluated in comparison 
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Figure 1.10 TEM cross section of IHP’s EBL-HBT module on Infineon’s 130 nm platform 
for the bipolar-only run (left) and full BiCMOS flow (right). 


to those of Infineon’s DOTFIVE DPSA-SEG results [Che11] and of the 
THP reference [Fox11]. Static and dynamic parameters are summarized in 
Table 1.2. 

Concerning the emitter—-base (BV £go) and collector—base (BV cgo) break- 
down voltages, it should be noted that the collector—-base and base-emitter 
profiles of the reference EBL HBT [Fox11], and consequently also those 
of the joint bipolar-only runs, are more aggressively scaled than those of 
the DPSA reference transistor of Infineon [Che11]. Due to the modifications 
listed above, the corresponding profiles of the BiCMOS version are relaxed 
resulting in similar values of BVppo but also of the current density at peak fr 
compared to the Infineon reference. 

Now, we turn to the evaluation of the high-frequency behavior. For 
the determination of fr and fmax, OPEN and SHORT de-embedded 
S-parameters up to 50 GHz were used. fr and fmax are extrapolated from 
the small-signal current gain |92; |and the unilateral gain U, respectively, with 
—20 dB decay per frequency decade. Regarding the transistor layout, the 
focus will be on the double-base contact (BEBC) design because it has been 
proved superior in terms of high-frequency performance. For the Infineon ref- 
erence transistor only single-base contact (BEC) data are available [Che11]. 
Therefore, a BEC configuration of the joint bipolar-only preparation was 
included in Table 1.2 to facilitate the comparison with the conventional DPSA 
data. IHP’s reference device consists of an 8-emitter BEC configuration with 
comparatively short emitter lengths optimized for the used unconventional 
collector construction. 
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Figure 1.11 Transit frequency fr and maximum oscillation frequency fmax vs. the col- 
lector current density jc for Infineon/IHP EBL fabrication in a bipolar-only process and in a 
BiCMOS run vs. the IHP EBL reference. 


Figure 1.11 shows fr and fmax as a function of the collector-current 
density for transistors of the two Infineon/IHP EBL versions and of IHP’s 
reference preparation. Looking at the Infineon/IHP bipolar-only results, the 
high-frequency parameters (peak f'r/fmax values of 300 GHz/500 GHz at 
Vcp = 0.5 V for the BEBC device, and 305 GHz/465 GHz for the BEC 
transistor) represent a substantial progress compared to those of the BEC 
Infineon device (240 GHz/380 GHz) or to results of other DPSA processes 
[Che14, Tril6]. In general, these figures fit well to the data of IHP’s reference 
transistor, although the BEBC layout shows even a 55 GHz higher peak 
fmax whereas peak fr is 20 GHz lower compared to the IHP reference 
(Table 1.2). In the case of fmax, these deviations are explained by the lower 
Ccp of Infineon’s collector isolation scheme while in the case of fr, the 
lower Rc and Rry of the IHP transistor design have to be considered. Note 
that the IHP reference was re-measured at Infineon under company typical 
measuring conditions. The somewhat lower fmax (445 GHz [Fox15]; 480 
GHz [Fox11]) is primarily attributed to the changed extrapolation frequency 
(20 GHz [Fox15]; 38 GHz [Fox11]). 

One important factor for the enhanced RF performance of the HBTs from 
the joint bipolar-only process vs. that of the Infineon reference is the increase 
of fr by about 25% due to an advanced vertical doping profile. Nevertheless, 
the main effect of the EBL process on fmax is the marked reduction of 
Rpg relative to the Infineon reference value of a conventional DPSA-SEG 
process. Already for the BEC configuration of the EBL HBT, a decrease of 
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the length-specific input resistance (Rp + Rp) by 40% relative to the Infineon 
reference is observed. Similar relations are true also of the BEBC BiCMOS 
device. It should be stressed at this point again how important identical 
RF transistor layouts, measurement tools, and extraction procedures are for 
reliable evaluations. For example, (Rp + Rp) values given in Table 1.2 are 
extracted from yj, circle fit. This leads to 17% lower values compared to the 
procedure applied in [Fox 11] based on a circular fit of s11. 

Considering the high-frequency behavior of the HBTs, promising results 
were demonstrated with respect to a reduced base link resistance. However, 
the fr of 240 GHz realized for the EBL module within Infineon’s 130 nm 
BiCMOS platform is significantly below the ambitious targets for next SiGe 
HBT generation. Certainly, the effect of a higher thermal budget of the post- 
HBT BiCMOS steps at Infineon compared to those of the original IHP flow 
has to be considered. In addition, more aggressive EB and BC doping profiles 
have to be applied for further f'r enhancement. 

It remains the question whether there are architecture- or flow-related rea- 
sons which make it more difficult or impossible to approach the performance 
values presented in the section “Optimization towards 700 GHz fmax” for 
the NSEG HBT also with the EBL concept. However, the finally achieved 
enhanced high-frequency parameters of the NSEG HBT were paid partially 
with increased process complexity. The search for an HBT architecture and 
corresponding process flow which combine best performance and reliable, 
cost-effective processing is in this context of continuing interest. 


1.3.2 Non-selective Epitaxial Growth of the Base 


Non-selective epitaxial growth of the SiGe base layer is widely used 
in SiGe HBT fabrication. Examples are production processes of IBM/ 
Globalfoundries [Orn03, Pek14] and TowerJazz [Prel1] as well as several 
technology generations developed by IHP [Kno04, Hei07, Rue10] and by 
NXP and imec [Don07, Huy11]. These processes have in common a layer 
stack consisting of a Si buffer layer, a SiGe layer containing the boron-doped 
base, and a Si cap layer deposited across the whole wafer. This layer stack 
grows mono-crystalline in active HBT areas where the Si surface is exposed 
while it grows poly-crystalline in all other areas covered with oxide or nitride. 
This so-called differential growth mode is in contrast to the SEG where the 
deposition occurs only in the exposed Si regions. 

An implication of the non-selective growth mode is that the poly- 
crystalline layer which is grown on the isolation layers adjacent to the active 
HBT can be used to form the extrinsic base regions. Typically this approach 
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is combined with an additional ion implantation into the extrinsic base 
regions to enhance their conductivity. This approach was applied, e.g., in the 
0.25-um BiCMOS process SG25H1 of IHP which provides peak ft/fmax 
values of 180 GHz/220 GHz [Hei07]. In such an approach, the thickness of 
the extrinsic base is defined by the layer stack grown to form the intrinsic 
base. This limits the achievable sheet resistance of the extrinsic base and in 
particular the conductivity of the extrinsic base region below the poly-emitter 
overhang. That is why several approaches have been developed to enhance 
the conductivity of the extrinsic base region by deposition of additional Si 
layers. It turned out that those elevated extrinsic base regions were necessary 
for extending fmax of NSEG HBTs beyond 300 GHz. An NSEG process 
with elevated extrinsic base regions self-aligned to the emitter window is 
used, e.g., in IBM’s 90 nm SiGe BiCMOS technology [Pek14] exhibiting 
peak ft/fmax values of 300 GHz/360 GHz. In a variant of the process, the 
RF performance could be further improved to ft/fqmax of 285 GHz/475 GHz 
with the help of millisecond annealing [Liu14]. 

An alternative NSEG HBT process with elevated extrinsic base regions 
is used in IHP’s 130 nm BiCMOS technology [Rue10, Rue12]. This HBT 
concept was the starting point for the performance optimization toward 
500 GHz fmax performed in the DOTFIVE project. It turned out to be 
a promising concept for even further performance improvement in the 
DOTSEVEN project. In the following, we review the main features of the 
corresponding HBT process flow. The elevated extrinsic base regions are 
formed by an additional epitaxial step after emitter structuring as first pub- 
lished in [Rue03]. The implementation described below corresponds to the 
technology SG13G2 of IHP offering HBTs with fr/fmax/gate-delay values 
of 300 GHz/450 GHz/2.0 ps. 

A schematic cross section of the HBT is shown in Figure 1.12. Key device 
features are: (1) Elevated extrinsic base regions self-aligned to the emitter 
window resulting in low extrinsic base resistance Rpx. (2) Formation of the 
whole HBT structure in one active area without STI between emitter and 
collector contacts resulting in low collector resistance and small collector- 
substrate junction areas. (3) Device isolation without deep trenches resulting 
in reduced process complexity and improved heat dissipation. 

Different stages of the HBT process are illustrated in Figure 1.13. The 
fabrication of the HBT module begins with the formation of the collector 
regions by high-dose ion implantation. The collector regions are laterally con- 
fined by shallow trench regions without introducing additional deep trenches 
[Hei02]. Active collector regions are defined by deposition and patterning 
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Figure 1.12 Schematic cross section of an NSEG HBT with elevated extrinsic base regions. 


an oxide layer. The opened windows in this isolation oxide layer are filled 
by SEG of un-doped Si on the exposed collector areas. Next, selectively 
implanted collector (SIC) regions are formed via a patterned resist mask. 
A cross section of the HBT at this stage of the process is shown in Fig- 
ure 1.13(a). Now, the non-selective growth of the base is performed. The 
layer stack consists of a Si buffer layer, the SiGe:C base layer, and a Si 
cap layer. It grows mono-crystalline in active collector regions and poly- 
crystalline on top of the isolation oxide as indicated in Figure 1.13(b). After 
epitaxy, an oxide/nitride/oxide layer stack is deposited and emitter windows 
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Figure 1.13 Process sequence for the NSEG HBT with elevated base regions: (a) after SIC 
formation, (b) after non-selective growth of the base, (c) before emitter deposition, (d) after 
emitter structuring. 
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are structured. Additional inside spacers are formed before depositing and 
structuring the As-doped emitter. Figure 1.13(c) shows the device cross 
section before emitter deposition. The emitter is capped with a dielectric layer 
and structured via a patterned resist mask. Outside spacers are formed on 
the emitter resulting in the device structure shown in Figure 1.13(d). Next, the 
sacrificial nitride layer is removed by wet etching followed by the selective 
growth of the B-doped elevated extrinsic base regions. The fabrication of 
the HBT module is continued with the patterning of the base poly-Si layer 
via a further resist mask. After the described process sequence for HBT 
structuring, the devices are exposed to a final rapid thermal processing (RTP) 
step which is used in the BiCMOS flow for the activation of source and 
drain regions. In the reference technology SG13G2 a spike anneal at 1,050°C 
is applied for this purpose. Finally, cobalt salicidation is performed on all 
contact areas and the aluminum metallization is processed. A schematic cross 
section of the final HBT structure is depicted in Figure 1.12. 

The TEM cross section in Figure 1.14 shows an NSEG HBT with elevated 
extrinsic base regions from the SG13G2 BiCMOS process. The geometrical 
width of the emitter window of the final device is 120 nm. The elevated 
extrinsic base regions are separated from the emitter window by about 25 nm 
wide oxide spacers. This device construction facilitates the realization of 
very low extrinsic base resistances Rpx due to the self-aligned positioning 
of the elevated extrinsic base to the emitter window and the high conductivity 
of the crystalline regions of the extrinsic base near the emitter. However, 
it has to be noted that the NSEG flow presented here exhibits a signifi- 
cantly lower degree of self-alignment than the DPSA-SEG and EBL process 
flows discussed in the section “Selective Epitaxial Growth of the Base.” 


Figure 1.14 TEM cross section of an NSEG HBT of the technology SG13G2. 
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In particular, the collector window, the SIC implantation, the emitter window, 
and the emitter poly overhang are not self-aligned to each other. Their relative 
alignment is defined by the alignment accuracy of the respective lithographic 
mask steps. This sensitivity to the accuracy of the lithographic alignment can 
impose severe limitation for further scaling of lateral device dimensions. 

Regardless of the above-mentioned limitations of the described NSEG 
HBT process with respect to self-alignment and scalability, it served as 
workhorse for optimizing the HBT performance by IHP within the projects 
DOTFIVE and DOTSEVEN. This arose from the greater experience and 
familiarity with the NSEG HBT compared to concepts with selective epitaxy 
discussed above. The fabrication of the NSEG HBT in the SG13G2 BiCMOS 
process resulted from intensive optimization of this transistor concept within 
the DOTFIVE project [Heil0]. This raises the question if and by what means 
further potential performance improvements could be achieved. According 
to device simulation, there are still respectable reserves for speed increase 
[Schlla, Sch11b]. In particular, lateral scaling should help to increase fyyax 
further. In addition, an appropriate vertical scaling of the doping and Ge 
profile is required for the desired objective of balanced high fr and fax 
values. 


1.4 Optimization of the Vertical Doping Profile 


Optimized device architectures as well as lateral and vertical scaling con- 
tributed to noticeable progress for fmax over the last years. In contrast, the 
potential for improving f'r seemed to be limited, in particular, if high fyrax 
values have to be retained. All successful attempts to push the peak fr of SiGe 
HBTs beyond 350 GHz delivered rather low fmax values. For example, the 
SiGe HBT which demonstrated the highest fr until 2015 showed a peak fr 
value of 410 GHz together with a peak fmax value of 190 GHz [Gey08]. 

Within the DOTSEVEN project, we considered two directions toward 
HBT performance optimization. First, we focused on aggressive scaling of 
vertical HBT doping and Ge profiles for increased fy. Second, a device 
architecture and process flow with a balanced fr-fmax design at highest 
performance level was aimed for. In the following, we describe the main 
results of the experiments for f'r optimization. 

Various vertical doping and Ge profiles were investigated in a simplified 
HBT flow described in [Kor15]. In these experiments, a reduced thermal 
budget of the HBT process was utilized for limiting dopant diffusion. Lat- 
eral device dimensions were relaxed with respect to the reference process 
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SG13G2 in order to reduce process complexity. Measured fr and fax 
vs. collector current density are plotted in Figure 1.15 for the optimized 
vertical profile. The peak fr values could be increased to 430 GHz together 
with peak fmax values of 315 GHz [Kor15]. These results were achieved 
for an HBT with a BEC layout configuration sketched in Figure 1.16(a). 
This layout corresponds to the standard HBT configuration in the SG13G2 
reference process. It was optimized for the special collector construction 
without STI between collector contact and active emitter area. For better 
comparability with the results of 2D device simulations presented below, we 
have also investigated devices with CBEBC layout configuration sketched in 
Figure 1.16(b). 

A further objective of these investigations in the DOTSEVEN project was 
the assessment of the accuracy of theoretical performance predictions from 
state-of-the-art device simulations based on the comparison of simulated and 
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Figure 1.15 fr and fmax vs. collector current density for an HBT with BEC layout 
configuration. Eight HBTs in parallel with individual emitter areas of 0.17 x 1.01 um? were 
measured [Kor15]. 
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Figure 1.16 Layout configurations: (a) BEC configuration with base and collector contacts 
at the ends of the emitter line, (b) CBEBC configuration with base and collector contact rows 
parallel to the emitter line. 
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measured electrical data. For this purpose, it is essential to determine doping 
profiles, material compositions, and device geometries of the experimental 
reference structures as accurately as possible. Below, we summarize the 
results of the experimental profile characterization. 

A combination of various analytic techniques was used to characterize the 
vertical profiles including secondary ion mass spectroscopy (SIMS), X-ray 
diffraction (XRD), and energy dispersive X-ray spectroscopy (EDX). The 
impact of the thermal budget of the fabrication process on the B and Ge 
profiles of the HBT is shown in Figure 1.17(a). The major part of the observed 
profile broadening occurred during the final spike rapid thermal processing 
(RTP) with 1,030°C peak temperature. The B profile experienced only a 
moderate broadening due to the suppression of B diffusion by Ge and C. 
However, significant diffusion is observed for the Ge profile itself resulting 
in a reduction of the peak Ge concentration from 32 at % in the as-grown 
layer to 28 at % in the final structure. 

The accurate determination of doping and Ge profiles in actual HBT 
structures represents an additional challenge. Width and doping concentra- 
tions of epitaxial layers depend in general on the size of the exposed Si 
area. However, SIMS measurements require dimensions which are much 
larger than typical active HBT areas. We have performed EDX measurements 
of the Ge depth profiles in typical HBT structures and in large windows 
of 600 wm x 400 um which were also used for SIMS measurements 
(Figure 1.17(b)). The width of the epitaxial SiGe layer was found to be 14% 
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Figure 1.17 (a) Depth profiles of Ge (blue circles) and B (red squares) measured by SIMS. 
Open symbols are as-grown profiles. Filled symbols are profiles after the full fabrication 
process. (b) Ge depth profiles measured by EDX in a 600 um x 400 um window (blue) 
and in a typical HBT structure (green). 
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smaller in the small HBT window while about the same peak Ge concentra- 
tions were measured in both structures. The Ge profile of the small window 
can be obtained from the Ge profile in the SIMS monitor by shrinking the 
depth scale by 14%. We assume that the same shrink of the depth scale applies 
to the B profile resulting in a 14% thinner profile in the small HBT window. 

The measured emitter, base, and collector profiles of the final HBT 
structure are plotted in Figure 1.18. The theoretically proposed doping profile 
corresponding to generation N3 of [Sch17] was included in Figure 1.18 for 
comparison. This profile N3 was proposed for an HBT generation with peak 
fr values of about 500 GHz. The measured and the theoretical N3 profiles 
show similar widths of the base and of the EB and BC junctions. The steep 
increase of the theoretically proposed collector profile toward 102° cm7’ 
was not realized in the experiment due to limitations in the formation of 
low-defective high-dose SIC profiles. 

The extracted doping and Ge profiles were used as input for 1D device 
simulations with the Boltzmann transport equation (BTE) solver [Hon09] and 
for 2D simulations with the hydrodynamic (HD) transport model [Kor17]. For 
a quantitative comparison between simulation and experiment, all relevant 
features of the HBT must be represented adequately by the 2D structure. To 
accomplish this task, the widths of the EB and BC depletion regions were 
adjusted to meet the measured area-specific capacitances Cggj and Cgcj. The 
extent of boron diffusion from the external base as well as the lateral extent 
of the SIC were tailored in such a way that the measured edge capacitance 
CBEe and Cpce are reproduced [Kor17]. The 2D geometry of the simulated 
device was adjusted to the TEM cross section of the measured device. 
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Figure 1.18 SIMS profiles measured after the final annealing step. The theoretically 
proposed doping profile N3 of [Sch17] is shown for comparison. 
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Measured and simulated transit frequencies fy are plotted in 
Figure 1.19. At low and medium current densities, the simulated fy val- 
ues agree well with the measurement. However, at high Jc, the simulation 
markedly deviates from the measured curve. The degradation of fy starts at 
lower Jc and is less abrupt in the simulation, leading to an about 11% smaller 
peak fy. Further investigations are needed to clarify this deviation. Addition- 
ally, 1D HD and BTE simulations of the inner transistor were performed in 
[Kor17]. The simulated peak f'r of the 1D transistor is about 80% higher than 
the corresponding 2D value due to the absence of peripheral capacitances 
and resistances as well as self-heating. These simulation results indicate that 
a further enhancement of fy can be expected for the given vertical profile 
when contributions of the device edges to the EB and BC capacitances and 
parasitic resistances are reduced. 


1.5 Optimization towards 700 GHz fmax 


In this section, we review attempts in the DOTSEVEN project to optimize 
the device structure and the fabrication process of the NSEG HBT for highest 
RF performance. The starting point for this optimization was the SG13G2 
technology. The investigated process modifications addressed the reduction 
of device parasitics by reducing lateral device dimensions as well as by 
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Figure 1.19 Measured and simulated fr vs. collector current density [Kor17]. Simulations 
were performed with the hydrodynamic model in 2D and 1D. Results obtained from the 1D 
Boltzmann transport equation are shown for comparison. The measured device has an emitter 
area of 0.28 um x 5.0 um and the CBEBC layout corresponding to Figure 1.16(b). 
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improving the control of the doping profile and the conductivity of critical 
device regions. 

The possibilities for lateral scaling of the emitter window by lithographic 
measures were already largely exhausted in the SG13G2 technology due to 
the resolution limits of the DUV tool at IHP. A further challenge for down- 
scaling of the emitter window width is the fabrication of conformal inside 
spacers in narrow emitter windows. This concerns in particular the deposition 
of homogenous dielectric layers with good step coverage and the reactive ion 
etching with minimum damage of the Si surface. Finally, we had to learn 
from a series of development loops that the room for a well-controlled down- 
scaling of the emitter width was very limited within the current process flow. 
Starting from a value of 120 nm in SG13G2, the emitter width was reduced 
to 100 nm in an intermediate process variant (split CR2). For the final device 
optimization, an emitter width of 105 nm was realized. 

First, we discuss the process stage that was used for circuit fabrication 
in the DOTSEVEN project. After using the SG13G2 technology for a first 
circuit fabrication run, an HBT process with improved RF performance was 
targeted for a second circuit fabrication run (CR2). The following process 
changes were addressed in this split: smaller emitter—base spacers and smaller 
emitter window widths were formed by modifying the corresponding depo- 
sition and etching processes. An emitter deposition process with enhanced 
As concentration previously explored in [Heil0] was introduced. The doping 
concentration of the epitaxially elevated base link regions was enhanced 
and a new base profile was applied. In addition to these measures which 
are compatible with the SG13G2 BiCMOS process, we explored for further 
optimization of the HBT performance process changes which are in conflict 
with the reference BiCMOS flow. The thickness of the cobalt silicide was 
increased and the thickness of the silicide blocking spacer at the sidewall of 
the emitter poly-Si was reduced to minimize the external base resistance. The 
peak temperature of the final RTP step was reduced in order to minimize 
diffusion broadening of the base and consequently the base transit time. The 
HBT cross sections depicted in Figure 1.20 indicate the decreased emitter 
window width, the reduced widths of the base-emitter and silicide blocking 
spacers, and the enhanced CoSig thickness of the split CR2 with respect to 
the reference process SG13G2 (G2). 

Electrical device parameters for the investigated process variants are 
summarized in Table 1.3. The process developments introduced in the CR2 
split resulted in significant improvements of fr, fmax, and the CML ring 
oscillator gate delay compared to the reference process G2 (Table 1.3). 
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Figure 1.20 TEM cross sections of HBTs from the process splits G2 (a), CR2 (b), 
and D7s (c). 


Table 1.3 HBT parameters of different process splits 


Measuring 
Unit Condition G2 CR2 G2N G2NF_ D7 D7s 

No. emitters 8 8 8 8 8 8 
WE um 0.12 0.10 0.12 0.12 0.105 0.105 
lg um 1.02 1 1.02 1.02 1 1 
fr GHz 314 351 325 331 498 505 
fmax GHz Vop=0.25V, 414 526 461 510 671 720 

T =300K 
jc @ peak fr mA/um? 20.3 263 19 20.6 31.9 344 
Cec fF s-parameter 14.7 15.0 148 14.5 15:7 15.1 
CBE fF s-parameter 20.6 19.2 21 20.8 20.3 20.4 


Reduced emitter and base resistances of CR2 devices were facilitated by 
enhanced dopant concentration of the revised deposition processes for the 
emitter and the extrinsic base. For this analysis, we extracted the emitter 
resistance (Rg) from simple fly-back measurements [Get78]. The impact of 
the different process splits on base resistance (Rg) is assessed on the basis 
of Rg + Rg values extracted from circle fits of s11. The modified SiGe base 
epitaxy is an additional source for the improved RF parameters of the CR2 
split. The increase of the Ge content enabled higher collector current densities 
leading to higher fr values. A smaller base-emitter junction width and a 
reduced spike temperature of the final RTP created a more aggressive vertical 
profile compared to G2. Higher fr values were obtained but at the cost of a 
decreased base-emitter breakdown voltage BVggo. The reduced base sheet 
resistance Rgp; of the split CR2 supported a further reduction of the base 
resistance and higher fmax values. Figure 1.21 shows a comparison of fr 
and fmax as a function of collector current density for the splits G2 and CR2. 
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Figure 1.21 Transit frequency fr and maximum oscillation frequency fmax vs. collector 
current density for devices of the split CR2 (second circuit fabrication run in DOTSEVEN) 
compared to the reference process G2. Device dimensions are given in Table 1.3. 


Next, we investigated the potential of enhanced dopant activation by 
millisecond annealing and low-temperature BEOL processing for further 
performance improvement. Their benefit for SiGe HBTs has already been 
pointed out in [Liul4]. Millisecond annealing with laser or flash lamp 
techniques facilitates a very high level of dopant activation with almost 
no diffusion. In order to take full advantage of this high activation level, 
subsequent process steps have to be kept at sufficiently low temperatures 
to avoid dopant deactivation. Within the DOTSEVEN project, we exploited 
a non-commercial flash lamp annealing tool at the Helmholtz-Zentrum 
Dresden-Rossendorf, Germany. Peak temperatures beyond 1,200°C could be 
realized. A modified contact formation and nickel silicidation were applied 
to decrease the maximum temperature after flash annealing below 500°C. 
Cobalt silicide formation does not meet this requirement. In contrast, the 
nickel silicide fabrication widely used in advanced CMOS nodes does not 
need annealing above 500°C. 

The impact of these processes on HBT performance was investigated 
in [Heil6] by two process splits of the G2 process. For the split G2N 
(Table 1.3), the original cobalt silicide process was replaced by a nickel 
silicide process and the temperature of the contact formation process was 
reduced. In addition, a flash annealing step was introduced before silicidation 
in the split G2NF. Figure 1.22 demonstrates the impact of these two process 
modifications on fr and fmax. Markedly enhanced fmax values by 15% 
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Figure 1.22 Transit frequency fr and maximum oscillation frequency fmax vs. collector 
current density for devices of the split G2N (nickel silicide) and G2NF (nickel silicide and 
flash annealing) compared to the reference process G2 [Heil6]. Device dimensions are given 
in Table 1.3. 


and 23% compared to G2 were obtained for the G2N and G2NF process 
splits, respectively. These improvements are mainly attributed to reduced base 
and emitter resistances due to the high level of dopant activation supported 
by reduced dopant deactivation during silicidation and BEOL processing 
and by enhanced activation due to flash annealing. Breakdown voltages and 
junction capacitances are hardly affected by these process modifications (see 
Table 1.3). 

The final optimization stage performed in the DOTSEVEN project 
[Hein16] is represented by the split D7 in Table 1.3. The scaled device 
D7s corresponds to the same process flow. There, the width of the collector 
window is reduced by 17% and the width of the emitter poly-Si is reduced 
by 33% with respect to device D7 resulting in somewhat lower Rp and 
Cgc. A cross section of the final NSEG HBT is presented in Figure 1.20(c). 
Regarding the emitter-window and emitter—base spacer width, the situation 
is very similar to the interim case CR2. For this HBT, an emitter width of 
105 nm was determined. The geometric dimensions of the revised versions 
also suggest that limits were reached for further decrease of the width of the 
EB spacers and of the emitter-poly overlap to the emitter window. 

The D7 process split combines the above introduced process modifica- 
tions of an improved EB spacer process, an extrinsic base with enhanced 
conductivity, nickel silicidation, and flash annealing with the following addi- 
tional amendments. A further optimized SiGe base epitaxy is applied which 
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closely resembles the B and Ge profiles of the f-p-optimized device described 
in the section “Optimization of the Vertical Doping Profile.’ The thicknesses 
of the lower doped parts of the emitter—base and base—collector junctions are 
reduced with respect to split CR2. The emitter utilizes the enhanced doping 
level but now with reduced layer thickness. Furthermore, the fabrication 
process for the SIC was revised. In the G2 HBT flow, the SIC was formed 
with the help of a patterned resist mask. In the D7 split, this process sequence 
was replaced by a hard mask with inside spacers. By this means, we gained 
additional flexibility in matching the lateral dimensions of the SIC and the 
emitter window. The implantation dose of the SIC was doubled in the split 
D7 with respect to the reference process G2. 

Figure 1.23 illustrates the measurement procedure for the extraction of 
fr and fmax for the device D7s. The small-signal current gain hg, and 
the unilateral gain U were derived from s-parameter measurements up to 
50 GHz and plotted as a function of frequency. The fr and fmax values are 
obtained from averaged values around an extrapolation frequency of 40 GHz 
assuming a gain decay of —20 dB per frequency decade. The quality of the 
measurement procedure was confirmed by independent measurements at IHP 
and at Infineon as described in [Hei16]. 

Figure 1.24 shows fr and fmax values of the devices D7 and D7s as a 
function of collector current density. Data for the previous device generations 
G2 and CR2 are included for comparison. The optimized D7 process reveals 


1 10 100 1000 
Frequency (GHz) 


Figure 1.23 De-embedded small-signal current gain h21 and unilateral gain U vs. frequency 
of the device D7s used for extraction of transit frequency fr and maximum oscillation 
frequency fmax with —20 dB decay per frequency decade [Heil6]. The emitter area is 
8 x (0.105 x 1.0) pm’. 
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Figure 1.24 Transit frequency fr and maximum oscillation frequency fmax vs. collector 
current density for two device geometries (D7 and D7s) of the latest process status of 
DOTSEVEN compared to the reference process G2 and the split CR2 [Heil6]. Device 
dimensions are given in Table 1.3. 


a strong enhancement of both fr and fmax. The obtained high fp values are 
supported by reduced transit times for the aggressively scaled vertical doping 
profile and by strongly reduced Rpg values which are accompanied by a strong 
enhancement of the transconductance gm in the high current regime. Highest 
peak fr/fmax values of 505 GHz/720 GHz were measured for the scaled 
device D7s [Heil6]. Both of these values represent the state of the art for 
SiGe HBTs. 

CML ring oscillator gate delays are plotted in Figure 1.25 as a function 
of current per gate. The oscillators consist of 31 stages and a 1:16 frequency 
divider. Currents per stage were adjusted to a single-ended voltage swing AV 
of 300 mV at a supply voltage Vpn of —2.5 V. The circuits use conventional 
resistive loads and do not apply special circuit techniques such as inductive 
peaking. The data plotted in Figure 1.25 for four of the device splits discussed 
above demonstrate that the improvement of fy and fmax for the devices 
D7 and D7s is associated with significantly reduced gate delay times. The 
minimum gate delay of 1.34 ps for device D7s represents the shortest gate 
delay that has been reported so far for a SiGe HBT technology [Hei16]. 
Until now, shorter gate delays have not been reported for any other integrated 
circuit technology. 

The aggressive lateral and vertical scaling for the D7 split resulted in 
reduced base-emitter and base—collector breakdown voltages as indicated 
in Table 1.3. In addition, Figure 1.26 illustrates a degradation of the base 
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Figure 1.25 CML ring oscillator gate delays vs. current per gate for oscillators consisting 
of 31 stages with single-emitter HBTs for the splits G2 (Ag = 0.12 um x 1.02 um), CR2 
(Ag = 0.1 wm x 1.0 um), D7, and D7s (Ag = 0.105 um x 1.02 wm) [Heil 6]. 


= 
o 
T 
D 
© 


|,=const. 


20 


= 
Oo 
& 


Collector-Current Density (mA/um’) 


open baser 


10° 0 
0.4 06 08 1.0 0.0 1.0 2.0 


Base-Emitter Voltage (V) Collector-Emitter Voltage (V) 


Base, Collector Current Density (mA/um’) 
3 


Figure 1.26 Gummel characteristics (a) and base-current forced output characteristics (b) 
for the splits D7s and G2. Symbols in (b) indicate the bias points for peak fr. The emitter 
areas are 8 x (0.12 x 1.02) um? for G2 and 8 x (0.105 x 1.0) wm? for D7s [Hei16]. 


current ideality factor nyg and stronger self-heating for the device D7s. We 
suppose that a further optimization of the EB doping profile and process 
advancement for an improved alignment of the SIC to the collector window 
will weaken some of these drawbacks in future. In fact, excellent DC char- 
acteristics have already been demonstrated for devices with nearly 600 GHz 


FS Max in [Heil 6]. 
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1.6 Summary 


Due to the intensive work in the projects DOTFIVE and DOTSEVEN, a 
new high-speed performance level of SiGe HBTs has become a reality. Most 
valuable is the fact that not only fmax and the ring oscillator gate delay 
are improved significantly, but also a new f'y record is demonstrated for the 
same transistor. The balanced increase of fmax and fr toward 700 GHz and 
500 GHz, respectively, makes these devices attractive for an even wider range 
of RF, mm-wave and sub-mm-wave applications. 

These improvements were achieved for the most part by optimization of 
the vertical profile, the SIC formation, and the reduction of the external resis- 
tances. A lower thermal budget in the BEOL processing or its combination 
with millisecond annealing can produce an extra performance increase due to 
an enhanced level of dopant activation. Apart from variations of the collector 
window or the emitter poly-Si overlap, the potential of a comprehensive 
lateral scaling was not exhausted. Besides adequate lithographic capabilities, 
this requires advanced conformal deposition and damage-less etching tech- 
niques as well as well-controlled epitaxial processes for the SiGe base in 
small windows. 

The modifications applied here for pushing the HBT performance toward 
700 GHz fmax did not consider the compatibility to any frozen CMOS 
or BiCMOS process. Additionally, we did not shrink back from additional 
processing steps as long as noticeable performance enhancement seemed 
possible and process safety was ensured. Consequently, it is concluded from 
the present results that the feasibility of the DOTSEVEN device targets is 
demonstrated but the challenging task to implement these capabilities in a 
next BiCMOS production technology has still to be done. It is an enormous 
challenge to integrate these HBTs in an advanced CMOS process with its tight 
constraints on thermal budget and device topology while reaching a similar 
HBT performance level and fulfilling simultaneously the industrial needs of 
simplicity, robustness, and high yield. 

It was the intention of IHP and Infineon in the DOTSEVEN project 
to investigate whether the HBT module with EBL could be a promising 
candidate for this task. As a result, two major directions for next steps 
could be derived: (a) Transfer of performance enhancing modifications which 
were tested successfully in the NSEG concept to the SEG EBL flow and 
(b) revision of the flow to enable further down scaling of the emitter window 
and to improve process safety under a production-like BiCMOS environment. 
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The demonstration of a new SiGe HBT performance level should stim- 
ulate device engineers and technology developers to create further ideas for 
cost-effective process flows with best performance potential. In this context, 
technological challenges related to non-selective or selective base epitaxy 
and partial or full self-alignment of the HBT layers need to be reinvestigated 
under the process constraints of advanced CMOS technology nodes. 
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2.1 Numerical Simulation 
G. Wedel and M. Schroter 


Numerical simulation (including TCAD) tools were heavily used during 
DOTSEVEN, aiding the device design and optimization during process 
development and the physical understanding of the HBT operation so as 
to support compact modeling. In addition, based on earlier work during 
DOTFIVE on exploring the physical limitations of SiGe HBTs [Sch11], var- 
ious numerical simulation tools were used to create the first comprehensive 
and detailed roadmap in cooperation with the radio-frequency/analog mixed- 
signal committee of the International Technology Roadmap of Semiconduc- 
tors (ITRS)! consortium in 2014. The flowchart displayed in Figure 2.1 for 
finding the vertical and lateral HBT structure of a major ITRS technology 
node is an example for the large variety of simulation tools that were 


'In the course of the semiconductor industry consolidation only very few companies 
have remained that have the financial means for developing advanced CMOS technologies. 
Pursuing partially diverse manufacturing approaches for the associated MOSFET structures, 
the common basis for technology development, which initially led to the ITRS consortium, is 
gradually disappearing and rather results in a competition. Hence, the ITRS consortium has 
been abandoned by the CMOS industry in 2015. The performance tables of the ITRS have 
been taken over by the IRDS consortium. 
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developed and employed in DOTFIVE and in DOTSEVEN, since the same 
approach has been used to define the DOTSEVEN target HBT structure. As 
already mentioned in-house tools for carrier transport have been based on the 
DD, HD and BTE approach. In particular, a new deterministic BTE solver 
was developed and applied to SiGe HBTs, which is described in Section 2.2.2 
In addition, simulation tools were developed for the analysis of thermal and 
parasitic effects in advanced HBTs. 

A three-dimensional (3D) thermal solver was used for investigating the 
temperature distribution within the device structure and its impact on the HF 
characteristics. A more detailed analysis was performed and insights into the 
microscopic effects of self-heating were gained with a new Boltzmann solver 
for phonon transport which was then coupled to the already existing spherical 
harmonics expansion based BTE solver for charge carrier transport. This 
approach, which allows deeper insights into device reliability, is described 
in Section 2.3. 


estimate 1D FoM (fr) 
targets 


3D thermal analysis 
= thermal param. 


estimate 1D doping 


parameter extraction 
profile 


HICUM model card gen. 


1D device simulation 
(BTE, HD) 


sweep lateral 
dimensions 


modify ar 
chitecture 


perf. & param 
acceptable 


change 


y doping profile 


circuit simulation 
=> figures of merit 


2D device simulation 


=> perim., distrib. eff. 


balanced fmay/fr? 


‘ [ modify device design 
2D/3D electrostatics structure] feasible? 


= external parasitics 


major node found 


Figure 2.1 Flowchart for finding the vertical and lateral HBT structure of a major technology 
node [Sch17]. 
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In order to obtain a realistic estimate for the HF device performance, a 3D 
Poisson solver has been used to calculate the various parasitic capacitances 
within an actual 3D transistor structure. 


2.2 Device Simulation 
2.2.1 TCAD Device Optimization 


The device design of advanced SiGe HBTs demands an as accurate as 
possible prediction of the electrical behavior in order to shorten the time from 
process development of new devices to product and to identify promising 
device structures during the early stage of process development. Therefore, 
technology computer aided design (TCAD) software offers a relatively fast 
and cost effective approach, compared to fabricating and measuring test 
devices. Of course, the predictive capability of TCAD strongly depends on 
the accuracy of the employed physical models, such as those for carrier 
transport, and their parameters. 

The most widely used description for carrier transport is the so called 
drift-diffusion (DD) model, which has been the workhorse in industry for 
over 40 years. The DD transport model is derived from the Boltzmann 
transport equation (BTE) by taking the first moments [Jun03][Sel84][Lun00] 
and consists of the 

e Poisson equation, 
e hole and electron continuity equation and 
e carrier transport equations. 


Its major advantage is the low computational cost in terms of both memory 
requirements and simulation time. However, these advantages are obtained at 
the cost of physical accuracy, especially for today’s SiGe HBTs. For example, 
the DD transport model significantly underestimates the HF performance of 
advanced SiGe HBTs (i.e. too low transit frequencies fy) and overestimates 
the impact of the Avalanche effect (e.g. too low BVCEO). 

For improving the accuracy while maintaining reasonable simulation 
times, the hydrodynamic (HD) and the energy transport (ET) model were 
introduced. These models can also be derived from the BTE [Jun03][Lun00] 
and include a description of energy transport. The ET model neglects the 
momentum conservation and is thus a subset of HD model. The latter consists 
of the 


e Poisson equation, 
e hole and electron continuity equation, 
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e hole and electron transport equations, 
e hole and electron energy balance equations and 
e hole and electron energy flux equations. 


Compared to DD, the HD transport model considers two additional equations 
for each carrier type. In addition and due to the two additional equations, new 
physical quantities, like the carrier energy relaxation time (for the energy 
balance equation) and the thermal conductivity involved in the energy flux 
description are introduced. Another point in terms of the HD transport model 
is the impact of the carrier temperature (solution variable of the energy 
balance equation as a representation of the kinetic carrier energy) on the 
catrier transport equation. For the transport equation, the impact of the carrier 
temperature, especially its gradient acting as additional driving force, needs 
to be rather heuristically adjusted than set by physics-based considerations in 
order to obtain reasonable results w.r.t. measurements or BTE simulation data 
[Wed10]. Once the above mentioned issues have been clarified, a reasonable 
agreement between HD and BTE can be obtained in terms of terminal 
characteristics, like transfer currents or transit frequencies. In addition, the 
HD simulation times are only slightly longer (by a factor of 1.5 up to 3.5) than 
those of DD simulation. However, BTE results or, if available, measurement 
data are needed for adjusting the HD transport model. Thus for the design of 
advanced devices and realistic estimations of their performance, a BTE solver 
is inevitable. 

For solving the BTE, two kinds of methods exist: stochastic solvers based 
on the Monte-Carlo (MC) method [Jun03][Tom93][Jac83] and deterministic 
solvers [Hon11][Gal05][Wed16], which solve the BTE directly based on 
discretized equations just like in the DD and HD case. The MC method is 
more widely used, since it offers a relatively low implementation effort. In 
addition, the memory consumption is low compared to deterministic solvers. 
However, the major drawbacks of the MC method are long simulation times 
(since MC is inherently a transient method), noisy results (due to its stochastic 
nature) and a insufficient resolution of minority carrier densities, e.g. elec- 
trons in the base region of an HBT. These drawbacks and the advances in 
computer performance (faster CPUs and cheaper memory) gave rise to the 
development of deterministic solvers. These solvers enable the calculation 
of stationary solutions, which are smooth and noise free even for minority 
carriers. However, these advantages are obtained at the cost of a high memory 
consumption and especially a much more elaborate mathematical preprocess- 
ing and implementation effort. Despite the strong reduction in simulation time 
compared to the MC method, the computational effort still is significantly 
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higher compared to DD and HD. Thus, the use of even deterministic BTE 
solvers is practically not feasible for direct device design optimization, but 
remains restricted to the 1D carrier transport and serves mostly as a reference 
solution for advanced vertical HBT structures. 


2.2.2 Deterministic BTE Solvers 


The BTE is a seven-dimensional integro-differential equation defined over the 
three real space dimensions (x,y,z), the three dimensions over the reciprocal 
space (kz, ky, kz) and time t. In conservative form, the BTE for electrons reads 
[Jun03][Lun00][Honll][Gal05 ][Wed16] 

of” q 

p t Vz e (vof) — ava (B%f”) =C, (2.1) 
with the particle distribution function (PDF) f” as the solution variable of the 
BTE (with 0 < f”< 1), the carrier group velocity ig and the collision term 


C. The effective field EY cg 18 defined as [Hon11][Wed16] 
E? v 
=> E 
se = V3 ( w+ “cay -) (2.2) 


with the electrostatic potential 7 obtained by the Poisson equation, the 
material/composition dependent band edge £¢ o and the kinetic energy £” 
The variable v denotes the observed valley within the first Brillouin-zone 
of the reciprocal space. For the sake of readability, the dependencies of the 
quantities involved have been omitted and are here shortly summarized: 


e f” is a function of P, k X and time t; 
° W is a function of 7 and T A 

ewi i; a function of 7 and time g 

e Ee, o is a function of P; 


e <” is a function of 7 and T g 


The collision term C describes the carrier interaction (scattering) due to lattice 
vibrations (phonon scattering), impurities/alloys and other carriers. 

For each considered scattering mechanism, a transition rate SX (X is a 
placeholder for the considered scattering mechanism) is obtained by Fermi’s 
Golden Rule and the collision term becomes (see e.g. [Jun03][Lun00]) 


C= SR SEP USPS SS POR?) 


X Zo! 


(2.3) 
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where the summation over ro discards the spin degeneracy [Lun00]. The 
first term within the sums describes the in-scattering and the second one 
the out-scattering of particles, respectively. Equation (2.3) sums first over 
all possible ro (defined by the transition rate (S*)) from where in- or 
out-scattering might occur and sums these results up for each scattering 
mechanism. The formulation of the collision term (2.3) contains terms 
(1—f”” ), which measures the vacancy of the state Kes v respectively. Thus, 
in- or out-scattering might be blocked due to a fully occupied state, which 
is called Pauli-exclusion principle [Hon11]. For low doping concentrations, 
where the Fermi-level is sufficiently far away from the conduction band edge 
(non-degenerate semiconductor), the PDFs foe are much smaller than one. 
In this case, (2.3) can be simplified to 


re > =, 
CHESS SSR” > K”)? —SX(K° > k?) 2A) 

ah! 
However, in both Equations (2.3) and (2.4) it is summed over discrete final 


™ / 
states k” . Under the assumption that adjacent states are close enough, these 
states are assumed to be continuous and the sum is converted into an integral 
by the relation [Gal05][Wed16] 


Q >, 
Pigs Ss es I ee ere 25 
5 san | (2.5) 


ro 


with the crystal volume Q. Thus, with (2.5) the collision terms become 
[Hon 1 1][Gal05][Wed16] 


Q 1 1 1 
pad D + KY lee ea 
(2r) & 
1 — 1 
FPA- f ))dk” (2.6) 
for the degenerate case and 
Re 


C= a S [ET > KO)? — SX (R? > R” )fOdR” (2.7) 
X 


for the non-degenerate case. 
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Thus, the considered system of equations is set up by the BTE (2.1) 
with the effective field (2.2) and the collision terms (2.6) or (2.7). For 
2 muenca treatment, it is more advantageous to express the vectors 
k and E” in terms of the kinetic energy €” and £” , measured from 
the minimum of the valley v/ v, respectively. For carrier scattering, both 
momentum and energy conservation has to be fulfilled (see e.g. [Lun00]). 
However, due to the complex shape of the phonon energy as function of 


x — f 
the scattering vector = k” — k” and the demand of €” (k? ) = 


e*(R”) + hw B) + (Feo - E2 o) (energy conservation), approximations 
are employed for the phonon energy as function of the scattering vector, 
which relax the momentum conservation. In practice, modeling the scattering 
is mainly focused on the energy conservation, since a is approxi- 
mated to be either zero (elastic scattering) or constant (inelastic scattering). 
With these simplifications, it is equivalent to consider the kinetic energies. 
Thus for the BTE, a coordinate transformation has to be performed and 
the valley dispersion relation e2 (K”) needs to be a analytic and invertible 
function. The most commonly used dispersion relation is the non-parabolic 
one [Lun00][Tom93][Hon1 1][Gal05][Wed16] 


o > 


ev(1 + ae”) = | k ?|2, (2.8) 


~ 2mom* 
with the effective mass m* and the non-parabolicity factor œ. This dispersion 
relation models equi-energy surfaces in the reciprocal space as spheres (due 
to | k’|?), where a describes the increase of energy for increasing | k ”|. For 
a = Q, a parabolic dispersion relation is obtained, where the kinetic energy 
increases quadratically with | k ”|. With (2.8), the vector k” can be expressed 
in spherical coordinates [Hon11][Gal05][Wed16] 


)| v1- posle) |, (2.9) 
v1- p’sin(¢) 


with u as the cosine of the polar angle and the azimuthal angle y. In (2.9), 
the spherical coordinate system is rotated to measure the polar angle jz w.r.t. 
the k®-axis instead of the k¥-axis. This rotation is advantageous for 1D 
simulations in x-direction, since it allows to omit y due to the rotational 
symmetry of the PDF. Thus in this case, only two dimensions (€” and ju) 
have to be discretized instead of the full [kk kz] space [Honll][Wedl6]. 


H 
5 /2 * 
k” = ——_ Jer(1 + ae? 
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Nevertheless, the equi-energy surfaces do usually not exhibit a spherical 
shape within the first Brillouin-zone of the reciprocal space. For example 
in Si, ellipsoids aligned to the axis [k,k,k,] are found for the so called X- 
valleys, which mainly contribute to the carrier transport. Thus, the dispersion 
relation (2.8) becomes in this case 


h v\2 kv)? v\2 
en (hee!) ie (= + a + ve) ; (2.10) 
0 x y z 


with the anisotropic effective masses m}, my and mz. In order to account 
for the valley anisotropy in (2.9), the Herring-Vogt transformation [Her56] is 
employed, which basically scales the axis [k} ky kz] in such a way that the 
ellipsoids are mapped to spheres. After the mapping, the considered k-space 
[ky ky kz] is given by 


> => 

B= (BV e R”, (2.11) 
= 2 K 7 

c= —e Væl tae’) | V/1—p2cos(y) |, (2.12) 


LEES 
m* 


[TEV] = 0 mo D l (2.13) 
0 0 as 
and the effective mass my = ¢/mimjym2. Like for the isotropic dispersion 


relation (2.8) and its k-vectors (2.9), also for the anisotropic description 
(2.10) and the k-vectors (2.11)—(2.12), the ~-dependence can be omitted for 
1D simulations in x-direction, as long as the Herring-Vogt matrix does not 
contain off-diagonal elements. This assumption holds for the X-valleys in 
Si/SiGe, but not for the L-valleys in some I-V materials, like GaAs. If one 
considers the 1D transport in x-direction, the effective field E 2, consists only 
of a non-zero x-component. Thus, the PDF is only altered by the effective 
field in k®-direction. If the equi-energy surfaces are spheres or ellipsoids, 
aligned to the [k,k,k,] axis, their rotational symmetry allow to omit ¢. 


Otherwise, a change of the PDF in kł-direction forces changes in the k®, ke 
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and ke directions and the PDF does not exhibit a rotational symmetry on 
the equi-energy surfaces anymore. Focusing on Si/SiGe and considering the 
x-direction only, the BTE (1) simplifies to [Wed16] 


of +4 4 op TEV WV, wE fv) — za (rave ó Euf”) = C; (2.14) 
ot dx 
with the carrier group-velocity 
1 ar 
vg" = T a (i) (2.15) 


after the Herring-Vogt transformation and p A ” being the first main- 
diagonal element of the matrix (2.13). Assuming a spatial independent 
transformation matrix THV”, the BTE (2.14) transforms into the €” /pl- 
space to 
ors v q 0 v +2 v 
=C (2.16 


with the flux coefficients 


1 UTE” 
i= — (2.17) 
a |k”] 
deV 
dos” (e” v 
Gia E E a (2.18) 
a_|k?| 
deV 
1 — p?)dos”(e? v 
gpa CA e E (2.19) 
|k”| 
the abbreviation 
~ V2 *1 1+ 2ae” 
a |Pl= mom% + 2a€ (2.20) 
dev ho 2 /e(1 + ae) 
and the density of states 
Brae 1 2moms s 7 
dos” (e”) = O e¥(1+ ae”)(1+ 2ae”), (2.21) 


which can be viewed as the transformed infinitesimal volume element 


ae = dos” (e”)de"dudy. (2.22) 
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At this point, the analytic pre-considerations/pre-processing, necessary for 
the discretization, is almost done. There are various approaches, such as 
the spherical harmonics expansion (SHE) [Hon11] or BIM-WENO approach 
[Gal05][Wed16], which rely on the same underlying equations but differ 
in the numerical representation for f” (PDF). The choice of the ansatz for 
f” strongly affects the further treatment of the collision term. However, 
regardless of the SHE or BIM-WENO ansatz, the transformed BTE (2.16) 
is multiplied by the density of states (2.21) and integrated over a discretized 
control volume in the reciprocal space. After the numerical representation of 
the derivatives and integrals (collision term) involved, a set of equations (for 
each discretization point) is obtained, which finally results in a sparse matrix 
to be solved. The system to be solved also contains the Poisson equation, 
needed for the electrostatic potential involved in the effective field LE’, 
(2.2). The main burden of solving the BTE deterministically arises from the 
number of needed discretization points, especially for e”. The step size for 
g” has to be fine enough to capture all energy exchanges by phonons, since 
otherwise these scattering processes get smeared out by others and results in 
less accurate results. 


2.2.3 Drift-diffusion and Hydrodynamic Transport Models 


As mentioned at the beginning of this chapter, both the DD and the HD 
transport model can be derived from the BTE by the method of moments with 
some simplifications. The Poisson and the continuity equations are employed 
in both DD and HD analysis. The Poisson equation reads 


Vz: (€co&r Ve (Y)) = —a(p —n+ Np — Na), (2.23) 


with the relative material permittivity €+, the elementary charge q, and the 
donor and acceptor doping concentration Np and Na, respectively. The 
carrier continuity equation reads 


Ve (I) = -sloe (r+ 5) (2.24) 


with the carrier density c, the carrier current density J. and the net 
recombination rate R. 

The first difference between the DD and HD transport models is found 
for the carrier transport equation [Syn15] 


J. = qepe Ecen — qsgn (c) ue Vr VPC 


— sgn(c)ueckg la + log (*)) Vp Te (2.25) 


ir 
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where the first two terms on the r.h.s. of (2.25) represent drift and diffusion 
transport, and the last term considers the gradient of the carrier temperature 
Te acting as additional driving force for the carrier transport process. Here, pe 
is the carrier mobility, Vp. is the thermal voltage for the carrier temperature 
Te Nz is the effective density of states of electrons and holes (depending 
on c), nir is the reference intrinsic carrier density, and f'¢ is a HD transport 


model parameter. Finally, Eei is the effective field given by 
E cot = —V7(U nO (2.26) 


with the carrier band potential Vg. accounting for the impact of high-doping 
and material composition effects on the respective band edge. Equations 
(2.24)-(2.26) contain the function sgn, which is depends the carrier type: 


_ f—1, for electrons (c = n), 
sgn(c) = { 1, for holes (c = p). k an 


The DD transport model consists of Equations (2.23)-(2.26), with a carrier 
temperature equal to the lattice temperature (Te = Ty,). Thus, for isothermal 
DD simulations, the last term on the r.h.s. in (2.25) vanishes. In the case of 
HD simulations, the carrier temperature Te is obtained by the energy balance 
equation 


ð Owe 
Zwe = Je: Beet (V>- Se) Rw, 4 = 


(2.28) 


coll. 


with we = 3kpT c and the energy flux 


2 
Se =n G sr 7] (=) qcheV r Te + sgn(c) G + =) Dig. 
2 q 2 q 

(2.29) 
Here, the first term on the r.h.s. Represents contains the thermal conductivity 
after Wiedemann-Franz and the gradient of the carrier temperature (energy 
transport due to spatially different carrier temperatures), where the second 
term models the energy transport due to the carrier current density. Within 
(2.25) and (2.29), three parameters f*¢, ft° and f° are available in the HD 
case for adjusting the impact of the respective contributions. These parame- 
ters are usually not fixed and vary across technologies and generations in the 
same material [Wed10]. The parameters are usually obtained by adjusting the 
HD terminal behavior (such as transfer and output characteristics and transit 
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frequency) to those obtained by BTE simulations or, if available, to measured 
results. The simplicity of the DD and HD transport models demands an 
elaborate set of physical material models for the 


e carrier mobility (low and high field case, DD, HD), 
e recombination and generation (DD, HD), 

e band potential (DD, HD), and 

e energy relaxation time (HD only). 


Usually, the material models are developed and their parameters are adjusted 
to BTE simulations of bulk material, where a homogenous and infinitely large 
semiconductor is assumed. 


2.2.4 Simulation Examples 


The intention of this section is to give an rough impression about the applica- 
bility of DD and HD transport models compared to the BTE for SiGe HBTs. 
As an example, a one-dimensional (1D) SiGe HBT structure with fr ~ 630 
GHz [Paw09] is considered. The doping and Germanium profile is shown in 
Figure 2.2. 


2.2.4.1 DD simulation 
The DD transfer characteristic and the transit frequency for the considered 
device are shown in Figure 2.3 and compared with the results obtained by the 
deterministic BTE solver based on the spherical harmonics expansion (SHE) 
of the electron distribution function [Hon11]. 

Compared to the BTE results, a reasonable agreement for the transfer 
characteristic between DD transport and BTE is obtained up to the onset 
of collector high current effects. However, DD severely underestimates the 
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Figure 2.2 Net doping and Germanium profile of a SiGe HBT with fr = 630 GHz. 
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Figure 2.3 Transfer characteristic (left) and transit frequency (right) obtained from DD 
transport and BTE for the device of Figure 2.2. Vcg = 1 V. 


transit frequency fr. The main origin of the discrepancies is coming from 
the assumptions involved in the derivation of the DD transport model. For 
DD, it is assumed that the distribution function is in equilibrium with the 
lattice. Thus, any displacement of the distribution function (towards higher 
kinetic energies) is not directly taken into account, but indirectly by the elec- 
tron velocity versus electric field model. Usually, a saturation drift velocity 
limits the carrier velocity although the velocity obtained by BTE might in 
some regions exceed that saturation limit (velocity overshoot) as shown in 
Figure 2.4(a) for the peak fr range. Thus, DD predicts slower electrons and, 
for the same current, a higher electron density in the base-collector region 
(cf. Figure 2.4(b)). This results in a higher electron transit time and thus, 
compared to the BTE, a lower fr. The constant DD electron density within 
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Figure 2.4 Electron velocity and density obtained by DD and BTE simulation. 
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the BC space charge region (SCR) is the result of the saturation drift velocity 
used in the DD simulation. 

The discrepancies shown in Figures 2.3 and 2.4 are one example of the 
deficiency of DD transport models. Another one is the underestimation of the 
breakdown voltage. 


2.2.4.2 HD simulation 

Here, the most critical parameters are ftt, ft° and f° introduced in 
Section 2.2.3. These parameters have a significant impact on the terminal 
characteristics, which can even show a non-physical behavior. Usually, a 
parameter constellations can be found that gives HD simulation results close 
to the ones obtained by the BTE. For the SiGe HBT in Figure 2.2, the impact 
of each of the above parameters is illustrated below. Figure 2.5 shows transfer, 
transit frequency and output characteristic for different values of ftt, while 
the remaining parameters are kept at zero. 


(a) Vac=0.0V (b) Vac=0.0V 
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Figure 2.5 Illustration of the impact of f* on the transfer characteristic, transit frequency 
and output characteristics for f" =f 0; 
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With increasing values of f*4 both the collector current density and the 
transit frequency are decreasing, while the output conductance decreases and 
can even become negative and thus non-physical in this case (of constant 
Vse). This is caused by a too strong gradient of the carrier temperature acting 
as driving force of the electron current. Thus, although f*4 often requires 
larger values for adjusting fT peak it needs to be limited. Fortunately, with 
the parameter ft° the impact of f*¢ on the output characteristics can be 
damped as shown in Figure 2.6. With f*°, the thermal conductivity involved 
in the energy flux equation (2.29) is altered. The smaller f* the less energy 
is transported by the gradient of the carrier temperature. Thus, the carrier 
temperature profile becomes less smeared within the device, which in turn 
diminishes the impact of the carrier temperature gradient on the transport 
equation (2.25). With a increasing f*°, the collector current densities are 
increased along with the output conductance. 


(a) Vec=0.0V (b) Vec=0.0V 
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Figure 2.6 Illustration of the impact of f*° on the transfer characteristic, transit frequency 
and output characteristics at fo = f= 0. 
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Figure 2.7 Exemplary illustration of the impact of f° (f**= f*° = 0) on the transfer 
characteristic, transit frequency and output characteristic. 


The last HD transport model parameter f°° scales the energy transport 
due to the current flow (see (2.29)). It has an opposite effect on the current 
density compared to f°, as shown in Figure 2.7. However, it allows to adjust 
the peak value of transit frequency (fp peak): 

The main burden for meaningful HD simulations is the determination of 
a proper set of HD transport model parameters. The following adjustment 
strategy and range of values proved to be suitable for the simulation of 
advanced SiGe HBTs: 


e ftd is used for adjusting Jc(Vgg) and fr(0.7 < ftd < 2); 
e ft° prevents a negative output conductance (—2.25 < ftd < —1.75); 
e f° is usually around zero (—0.5 < f% < 0.5). 
Figure 2.8 compares the HD results obtained by the default HD model param- 


eter set taken from SDevice [Syn15] with those obtained after adjustment to 
BTE results. The DD results are also shown for comparison. 
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Figure 2.8 Transfer characteristic and transit frequency of the SiGe HBT in Figure 2.2 
obtained from HD simulation with adjusted HD transport model parameters and SDevice 
defaults [Syn15], compared to BTE and DD simulation results. 


Compared to DD, the HD transport model with the SDevice defaults gives 
already a good agreement for the transit frequency. However, the current 
densities are about twice as high as those obtained by the BTE and DD results. 
This discrepancy can be eliminated by adjusting the HD parameters to the 
BTE results. However, this agreement in the terminal characteristics is not 
reflected in the internal quantities (e.g. electron densities), which exhibit a 
different spatial dependence compared to the BTE results. In addition, there 
is no common set of HD parameters across technologies and generations in 
the same material. These parameters need to be readjusted for each major 
doping profile change in order to obtain reasonable results. Therefore, the 
computationally expensive BTE simulations are mandatory. In practice, it 
suffices though to simulate just the major technology nodes with the BTE 
and use those results to find the HD parameters for each node. With these 
parameters, HD simulations can then be applied for device optimization 
within a particular node [Wed10]. 


2.2.4.3 Effects beyond DD and HD transport 

Due to the assumptions and simplifications involved in the derivation of the 
DD and HD transport models, some physical effects can not be captured by 
them compared to BTE solutions. 

One assumption is the so-called single electron gas approximation 
[Blo70]. Here, the transport relevant electrons are assigned to one val- 
ley, which dominates the transport. In the case of silicon, the six valleys 
in A-direction (usually denoted by X-valleys) are combined to a single 
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Figure 2.9 (a) Net doping and Ge profile of the SiGe HBT given in [Sch11] and the 
corresponding (b) transfer characteristic and (c) transit frequency obtained from BTE and 
HD simulation. 


valley, which is energetically located at the conduction band edge. How- 
ever, for SiGe and for material under biaxial-compressive strain, two of 
the six A-valleys are differently influenced with increasing Ge content 
[Hon11][Wed16]. In this case, two A-valleys are energetically shifted to 
higher potential energies, while the remaining four A-valleys undergo a 
downward shift in the potential energy. Hence, two conduction band edges are 
forming, where the lower conduction band edge is associated with the four 
and the higher with the remaining two valleys, respectively. In the case of DD 
or HD simulations, only the 4-fold lower conduction band edge is considered, 
neglecting the higher 2-fold conduction band edge. Figure 2.9 shows the 
impact of the neglected 2-fold conduction band for the SiGe HBT presented 
in [Sch11], which represents the presently known physical limit of SiGe HBT 
technology. According to Figure 2.9, the HD simulations overestimate both 
the collector current density and the peak transit frequency by about a factor 
of two. These discrepancies are caused by the abrupt rising edge of the Ge 
profile and the neglect of the second conduction band edge. 

For clarification purposes, the band edges, valley occupancies and quasi- 
static electron densities at fT, peak as obtained by BTE simulations are shown 
in Figure 2.10. Due to the abrupt rising edge of the Germanium profile, the 
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Figure 2.10 (a) Band edges and Ge profile, (b) valley occupancy, and (c) electron density of 
the SiGe HBT shown in Figure 2.9 obtained by BTE simulation at the operating point of peak 
transit frequency. 


conduction band edge associated with the 2-fold A-valleys is energetically 
lifted up abruptly and thus forms an energy barrier (cf. Figure 2.10(a)). Only 
few high energetic electrons are able to overcome this barrier, while instead 
most of the electrons accumulate at the barrier (cf. Figure 2.10(b)). This leads 
to an additional charge, which reduces both the transit frequency and the 
transconductance [Hon11]. Obviously, the single electron gas approximation 
used in conventional DD and HD tools cannot capture this effect. 

In fabrication, a step Ge profile is unrealistic and a graded profile rather 
occurs. The impact of a graded Ge profile is sketched in Figure 2.11. With 
the graded Ge profile (Figure 2.11(a)), the 2-fold A-valley is continuously 
shifted to higher energies so that the electrons are now able to gradually 
transfer to the 4-fold A-valley, preventing an abrupt charge accumulation 
(Figure 2.11(c)). 
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Figure 2.11 (a) Band edges and graded Ge profile as well as the corresponding (b) valley 
occupancy and (c) electron density obtained by BTE simulation at the operating point of peak 
transit frequency. 


While the grading prevents a degradation of the transconductance, the 
higher quasi-static change of the electron density causes an increase of 
the total capacitance Crot connected to the base terminal. According to 
Figure 2.12(a), Cot increases by a factor of 1.76, whereas the transconduc- 
tance Jm improves by a factor of 2.57 (Figure 2.12(b)). Overall this leads to 
an increase of fp. According to 


=s (2.30) 


the higher improvement of the transconductance is resulting in an increase of 
fr by a factor of 1.46. 

Figure 2.13 shows the improvements in the collector current 
(Figure 2.13(b)) and in fr (Figure 2.13(c)) due to the graded Ge profile. 
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Figure 2.12 (a) Comparison of (a) the total 1D capacitance connected to the base node and 
its components and (b) the transconductance, obtained for the initial (abrupt) and the new 
(graded) SiGe HBT profile. 
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Figure 2.13 (a) Comparison of the initial and the new SiGe HBT profile with corresponding 
(b) transfer characteristic and (c) transit frequency, obtained by both BTE and HD simulation. 


The good agreement of the HD results the with those of the BTE can only 
be obtained by readjusting the HD transport model parameters, since the Ge 
grading constitutes a major profile change. 
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Another limitation of the DD and HD transport model is the assumed 
shape of the distribution function for deriving them. Conventionally, a Max- 
well-Boltzmann distribution is assumed over energy which, in the DD case, 
is assumed to be in equilibrium with the lattice or, in the HD case, has a 
modified decay over energy due the spatially dependent carrier temperature. 
However, in both cases a displacement of the distribution function off its 
equilibrium position is not considered. Contrary to DD and HD transport, the 
BTE is solved for the distribution function at each real-space point and over 
the reciprocal space and thus offers information about both the actual decay 
over energy and its displacement from the equilibrium position. Depending 
on the considered doping profile, different shapes of important characteristics 
can be obtained. 

As an example, the SiGe HBT N3 in [Sch17] and shown in Figure 2.14(a) 
is considered. According to the transfer characteristics and transit frequency 
in Figure 2.14(b), (c), the deviations between HD and BTE occur despite 
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Figure 2.14 (a) Doping and Germanium profile of a SiGe HBT and the corresponding (b) 
transfer characteristic and (c) transit frequency obtained from BTE and HD simulation. 
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Figure 2.15 Transconductance and total capacitance of the N3 SiGe HBT. 


adjusted HD transport model parameters. Especially for the transit frequency, 
the BTE predicts an spike-like increase to fT, BTE,pk Compared to HD trans- 
port. As explained below, the deviations between HD and BTE originate from 
the doping profile in conjunction with the doping dependent description of the 
bandgap narrowing and the subsequent shift of the conduction band edge. 

In Figure 2.15, the bias dependent total capacitance and the transconduc- 
tance of the N3 HBT are shown. According to (2.30), the spike-like increase 
in fr originates from the strong increase and dip of the transconductance. 

Since the electron transport within the device is dominated by the peak of 
the conduction band edge. The conduction band edge of the 4-fold A-valley 
and the contours of the corresponding electron distribution function for three 
different operating points are shown in Figure 2.16. Since the Pauli exclusion 
principle is not considered, the distribution function exhibits values larger 
than one (i.e. > 0 in Figure 2.16) near the conduction band edge in the highly 
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Figure 2.16 Conduction band edge versus location and superimposed contour lines of the 
(logarithm of the) electron distribution function of the 4-fold A-valley over energy within 
the emitter-base region for three operating points: (a) around fT,BTE,pk/2, (b) just before 
fT,BTE,pk> and (c) and just after fT,BTE,pk: 
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doped regions. As discussed before, the 4-fold A-valley carries by far most 
of the electrons and thus it is sufficient to focus on this valley only. 

According to Figure 2.16(a) and (b), the conduction band peak for the 
current range up to around fT BTE,pk is located at x = 19 nm, which coin- 
cides with the steep doping gradient at the BE junction (see Figure 2.14(a)). 
This decrease in conjunction with the commonly employed (doping depen- 
dent) bandgap narrowing model [Slo77] leads to a sudden increase of the 
bandgap and consequently to a bias independent conduction band barrier, 
which is for the considered doping profile approximately 2.5 nm wide with a 
barrier height of around 10 meV. Due to the bending of the conduction band 
at higher applied Vgp-voltages, a plateau like conduction band edge is seen 
prior to fT BTE,pk in Figure 2.16(b). At an operating point beyond fT BTE,pk 
(see Figure 2.16(c)), a potential well is forming between x = 13 nm, which 
corresponds to the transition from the low to the highly doped emitter region 
and the associated bandgap difference, and x = 19 nm. 

At current densities below fr BTE,pk, only high energetic electrons can 
overcome the conduction band peak at x = 19 nm, which corresponds to the 
classical function of the Vgg modulated conduction band barrier that blocks, 
in the absence of tunneling, low energetic electrons. With increasing Vpr 
this barrier decreases and is overcome by a larger fraction of electrons. The 
resulting higher average electron velocity increases the current and transcon- 
ductance and thus fr. Around fT ,BTE,pk, the bias independent conduction 
band peak x = 13 nm starts to get exposed and leads to a wider barrier 
region with a potential well enclosed. Beyond fT BTE,pk, this wider barrier is 
more efficient in blocking the low energetic carriers. In addition, carriers also 
accumulate in the potential well, thus causing a rapid decrease of fr. 

In the case of the HD transport model, the Boltzmann distribution func- 
tion is altered in its spread by the carrier temperature. But the behavior of low 
and high energetic electrons can still not be separated and thus the blocking 
effect of low energetic electrons around fT BTE,pk can not be captured. 
Figure 2.17(b) compares the assumed HD distribution function with the one 
obtained by the BTE solver. Compared to the BTE result, HD overestimates 
the low energetic and underestimates the high energetic electron popula- 
tion. Therefore, the hill observed in g,, in Figure 2.15 originating from 
the increased average electron velocity at the conduction band peak is not 
observed in the DD or HD results, which are compared in Figure 2.18(a). 
Thus, the spike-like increase in ft sre (see Figure 2.14(c)) can neither be 
reproduced by DD nor HD. 
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Figure 2.17 Electron distribution function within the 4-fold A-valley just below fr BrE,pk 
within the emitter-base region. (a) Contours with the arrow marking the position of the doping 
induced conduction band barrier. (b) Comparison of HD and BTE distribution function at the 
barrier. 
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Figure 2.18 Comparison of (a) the transconductances and (b) the total capacitance obtained 
by DD, HD and BTE simulation results. 


In addition and due to the missing energy separation, all DD or HD 
electrons participate to the charging of the capacitance, contrary to the BTE 
where the blocked low energetic carriers do not contribute. Thus, the DD and 
HD transport models are overestimating the total capacitance, as shown in 
Figure 2.18(b). A more detailed insight is given by Figure 2.19. Figure 2.19(a) 
compares the electron densities for the three considered operating points 
obtained by the BTE (lhs) and the HD (rhs) solver. Contrary to the HD 
results, only a slight variation of the electron densities within the lightly 
doped emitter is seen around fT BTE,pk due to the blocking of low energetic 
carriers. Since these blocked electrons do not participate to the charging of 
the dynamic capacitances, the quasi-static electron densities (dn/dVgp) in 
the lightly doped emitter region are lower compared to those obtained by HD 
(see Figure 2.19(b)). 
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Figure 2.19 Comparison of the BTE and HD electron densities obtained by (a) an DC and 
(b) an quasi-static analysis. In (b), also the quasi-static hole densities are shown. For (b), a 
different axis intercept is used compared to (a) in order to visualize the contributions to the 
emitter junction capacitance Cjni. 


In terms of the emitter depletion capacitance [Sch06] 


= mE ə 
Ciki = f AV; (n—p)| dz, (2.31) 
0 BE UBC 
where tmp = 2(dn/dVpp=dp/dVpr), the region of blocked electrons 
reduces the contributions to Cini from the regions before, due to the higher 
quasistatic hole density (see Figure 2.19(b), Ihs). Therefore, around fr»; the 
value of Cixi is overestimated in the HD results compared to the BTE results, 
as shown in Figure 2.20. 

The spike-like increase discussed above has not been measured on fabri- 
cated devices, possibly because this effect might either be weakened due to 
tunneling or be masked by the impact of peripheral and external elements. In 
addition, the fabricated doping concentrations so far just do not have such a 
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Figure 2.20 Comparison of Cri obtained by BTE and HD simulations. 


(a) (b) Vac=0.0V 
107! r 30 102 
“T 102 ] : 
z| i 
oC 1 
= 10” i qe 
£ i Sm io 
= 1918 i oO 
| i =- BTE (new) 
= 1017 — HD (new,adj.) 
-©- BTE (ini.) 
4016 o% 
9 1 0.8 0.85 0.9 0.95 
Veg /V 
Vgc=0.0V 
(c) 
800 
3 600 
io) 
lin 
€& 400 
HD (new,adj.) 
200 -© BTE (ini.) 
10° 10! 10? 
Jo/ 2 


Figure 2.21 (a) Doping profile with smoothed high to low transition in the emitter (new) and 


previous step-like profile (ini.). Corresponding terminal characteristics: (a) transfer current and 
(b) transit frequency. 


steep gradient. The latter hypothesis has been tested in Figure 2.21 showing 
a smoothed transition in the emitter doping (Figure 2.21(a)). This results in 
the disappearance of the previously observed peak fr overshoot for the BTE 
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Figure 2.22 Electron distribution function within the 4-fold A-valley at fr,BTE,pk within 
the emitter-base region. (a) Contours with the arrow marking the position of the doping 
induced conduction band barrier. (b) Comparison of HD and BTE distribution function at 
the barrier peak. 


solver and terminal characteristics that are fairly close to the HD results (after 
adjusting though the HD transport model parameters (f*¢, ft° and f°°). 

In Figure 2.22(a), the conduction band edge and the contour lines of 
the electron distribution function for the smoothed doping profile are shown 
within the BE region. The corresponding distribution function in Figure 
2.22(b) at the conduction band maximum at x = 17 nm shows no blocking 
of low energetic electrons and thus the HD and BTE results are approaching 
each other. 


2.2.4.4 Comparison with experimental data 

In the course of the DOTSEVEN project, a variety of experimental data of 
fabricated SiGe HBTs were evaluated. In order to evaluate the predictive 
capability of the TCAD infrastructure employed within the project, measured 
terminal characteristics, such as transfer current and fp characteristics, were 
compared with simulation results. Here, a SiGe HBT fabricated by IHP 
and shown in Figure 2.23(a) is considered. For the comparison with the 
1D simulation results, the measured data were deembedded by the external 
elements of the actual 3D transistor structure. This was accomplished by 
using the physics-based and geometry scalable properties of the HICUM/L2 
compact model and its parameters, which were extracted from measured 
data of transistors and special test structures [Paw17][Kor15]. A comparison 
between the respective 1D measurements with the HD and BTE simulation 
results is given in Figure 2.23(b), (c) for the transfer current and transit 
frequency in the absence of self-heating, the impact of which has already 
been accounted for during the parameter extraction. 
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Figure 2.23 (a) Doping and Ge profile of a SiGe HBT (fabricated by IHP) with correspond- 


ing (b) transfer current, and (c) transit frequency. Comparison HD and BTE simulation results 
with 1D measurement data. 


According to Figure 2.23(c), both HD and BTE simulation results agree 
well with the measured data around peak fr and beyond. Below peak fr, 
discrepancies exist which may be attributed to (i) too strong deembedding 
of parasitic or external junction capacitances in the measured data, (ii) an 
incorrect doping and Ge profile resulting in lower junction capacitances or 
doping and Ge dependent bandgap for the device simulation, (iii) the neglect 
of Carbon in the base region, or (iv) a significant difference in the doping, 
Ge and C dependent bandgap modeling in the simulation. The observed 
discrepancy in the transfer current (cf Figure 2.23(b)) indicates the latter as a 
major cause for the differences at low current densities. 

According to Figure 2.24, the experimentally determined bandgap nar- 
rowing is indicating the presence of metastable strain, as reported in [Bea92]. 
For a first estimation of the impact of different bandgap narrowing values 
as function of Germanium on the terminal characteristics, a simple linear 
model (“‘lin.” in Figure 2.24) is employed for DD and HD simulations. The 
corresponding terminal characteristics are shown in Figure 2.25. 
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Figure 2.24 Comparison of the Ge concentration induced bandgap narrowing from exper- 

imental data (exp.) and device simulation model (mod.). In addition, the lower and upper 


boundary (Ib and ub) for bandgap narrowing as function of Ge the presence of metastable 
strain [Bea92] is shown. 


(a) Von=1.5V (b) Vor—1.5V 
10? 500 
--- HD (mod.) 085 
10? 400  — BTE (SHE ./— 
~--HD (lin) 54 Q 
10° % 
e a 
—1 
2 foe HD (ned) = 20 O, K 
pe — BTE (SHF) O Ò; 
O 1D meas. Oo Fá % 
10-2 ERE te DD (lin.) i 100 rs x 
- --HD (lin.) a O1D meas. ‘ 
10-4 0 Wai 
0.6 0.7 0.8 0.9 10-1 10° 101 102 
VBe/V 


Jn) mA 
C/ am? 


Figure 2.25 Comparison of the 1D measurement data with DD, HD and BTE simulation 
results. For the DD and HD simulation, the linear model indicated in Figure 2.24 is used. The 
transit frequency obtained by DD is not shown, since its parameters have not been adjusted. 


According to Figure 2.25(a), the linear bandgap narrowing model 
improves the agreement between the simulated and experimental transfer 
characteristics at low current densities, but there is little improvement for 
fr (cf. Figure 2.25(b)). Since advanced SiGe HBTs exhibit a significant 
carbon content, which is has not been considered in the simulations, further 
investigations based on experimental data (variation of the Germanium and 
carbon contents) are needed to clarify the origin of the discrepancies. 

Nevertheless and focusing on trends only, the TCAD infrastructure 
employed in the DOTSEVEN project is capable of predicting performance 
trends correctly. Figure 2.26 displays the peak values of the measured 1D 
transit frequency for four different SiGe HBTs obtained from a process split. 
Here, both the quantitative and qualitative trend of the measurements is well 
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Figure 2.26 Comparison of the performance trends predicted by TCAD (HD and BTE 
simulation) with 1D measurement results (three samples) for a process split with four different 
SiGe HBTs (fabricated by HP). 


captured by HD and BTE simulation. For the SiGe HBT labeled by HBT 
#3, the BTE results are overestimating the peak transit frequency. However, 
as explained before, this overestimation is based on an improper doping 
profile description at the metallurgical emitter-base junction and thus due to 
an underestimation of low energetic electrons. 


2.3 Advanced Electro-thermal Simulation 
C. Jungemann and N. Rinaldi 
2.3.1 Carrier-Phonon System in SiGe HBTs 


Charge carriers are accelerated under high electric fields in semiconductor 
devices and gain high kinetic energies. Carriers with energies higher than 
60 meV scatter mainly with optical phonons. The optical phonons, which 
have a negligible group velocity, cannot participate in heat transport. Instead, 
they decay into long-wavelength acoustic phonons, which determine heat 
conduction in semiconductors. Since this decay is relatively slow, compared 
to the carrier-phonon interactions, a bottleneck for energy dissipation can 
occur, which results in a large number of hot optical phonons in high-field 
domains. The carrier—phonon and phonon-phonon interaction processes with 
their corresponding scattering time constants are illustrated in Figure 2.27 
[Pop06]. 

In order to investigate self-heating in ultra-scaled bipolar transistors 
precisely, we have to consider a coupled system of transport equations for 
electrons, holes, and phonons. For this goal, the coupling terms, which 
describe carrier-phonon interactions, must be modeled accurately. In fact, 
carriers gain energy from the electric field and diffuse several mean free paths 
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Figure 2.27 Thermal energy transport diagram in semiconductor devices. 


(tens of nanometers) before they lose their energy to the lattice. Therefore, the 
so-called Joule-heating term, which represents the energy that carriers receive 
from the electric field, is not appropriate to capture the spatial distribution 
of heat generation in submicron devices. In order to tackle this problem, 
an advanced hydrodynamic model has been proposed [Mus08] to describe 
heat generation and transport in sub-micron silicon devices. However, this 
approach is still based on a single averaged carrier temperature within the 
relaxation time approximation and does not account for the spectral informa- 
tion regarding the emitted phonons. Since inelastic carrier—-phonon scattering, 
which is described in detail by the scattering integral of the Boltzmann 
transport equation (BTE), causes heat generation, solving the BTE is the most 
accurate approach to study carrier—phonon interactions [Pop10]. 

The Monte Carlo (MC) method has been widely used to solve the BTE 
for electrons coupled with heat transport equations. The Fourier heat equation 
as the most elementary approach to consider heat conduction was used 
in [Zeb06, Sad10], which is not valid for sub-micron devices. In a CPU- 
efficient approach, a system of energy balance equations for both optical 
and acoustic phonons was extracted from the phonon BTEs. This system 
of equations coupled with an electron MC simulator, which was used to 
study self-heating in silicon-on-insulator devices, can describe the phonon 
bottleneck in thermal energy transport by distinguishing between optical and 
acoustic phonon temperatures [Ral08, Vas09]; however, it cannot capture all 
microscopic effects of non-equilibrium phonon transport due to the averaged 
phonon temperatures. Nghiem et al. [Ngh14] introduced recently an elec- 
trothermal simulator, which solves the BTE for both electrons and phonons 
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self-consistently. However, in their system of equations, the feedback to 
the electron system is the effective temperature extracted from the phonon 
distribution function. 

Despite great advances in understanding the physics of phonon transport 
using the MC method, the stochastic basis of this method hinders calculation 
of parameters with very small or slow variations. Alternatively, a determinis- 
tic approach based on spherical harmonics expansion (SHE) can be used to 
solve the BTE [Gnu93]. In this regard, Ramonas et al. [Ram15] presented a 
deterministic solution of a non-equilibrium bulk electron—phonon system for 
noise calculations. 

Most recently, Kamrani et al. [Kam17a] presented a SHE method for the 
coupled BTEs of electrons, holes, and phonons under stationary conditions 
in a SiGe HBT. Since it has been shown that carriers in SiGe lose their 
energy mainly by scattering with longitudinal optical (LO) phonons [Pop05, 
Nil2], they solved the phonon BTE only for the LO phonon mode, and used 
energy balance equations for the other optical and acoustic phonon modes. 
In addition, the reduction of the thermal conductivity in a SiGe HBT was 
accounted for by analytical models for the lattice thermal conductivity in a 
way that is consistent with empirical data. 


2.3.2 Deterministic and Self-consistent Electrothermal 
Simulation Approach 


In the framework of semi-classical transport theory, the kinetics of a non- 
equilibrium carrier-phonon system under stationary conditions is described 
by a coupled set of BTEs for the distribution functions of carriers (electrons/ 
holes) f¢/"(7, F) and phonons n (7, 7), defined on the position vector 


7, carrier and phonon wave vectors k and re respectively. To investigate 
the impact of hot LO phonons on carrier transport, the non-equilibrium 
distribution function of LO phonons must be obtained, while for the other 
phonon modes, equilibrium distribution functions, which are evaluated at 
averaged phonon temperatures of the optical Top and acoustic Tac phonon 
branches, can be assumed. In this case, the BTE for the charge carriers is 
written as: 


Lif} = QO {fn} +S {f, Top, Tae} (2.32) 


where L{f} is the free-streaming operator, QŁO {f, n} is the scatter- 
ing operator for inelastic interactions of carriers with non-equilibrium LO 
phonons, and S 4f, Top, Tac} denotes the scattering operator of all other 
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scattering mechanisms. The scattering term for non-equilibrium LO phonons 
is expressed as: 
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where Co (7) is the interaction constant, hwop is the constant energy for the 
dispersionless LO phonons, and the upper sign refers to phonon absorption 
and the lower to phonon emission. 
The BTE for non-equilibrium LO phonons is written as: 
=> 

ata east) +G {n, f} =0 (2.35) 
where the first term represents interactions between optical and acoustic 
phonons within the relaxation time (Top) approximation, neg (Tz) is the 
equilibrium phonon distribution function, which follows the Bose-Einstein 
statistics, Tj, is the lattice temperature which is equivalent to the averaged 
acoustic phonon temperature, and G° {n, f} is the phonon generation term 
given by: 


{n f}= ZY, J [wes T k Gn) 
2 a (es 7.n)| pe (7, k) dk 
(2.36) 
where W3” and Wem refer to the transition rate for phonon absorption and 
emission, respectively. 
To solve this coupled system of BTEs, all the terms have to be expanded 
into spherical harmonics, and the spherical coordinates of the g-space 
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(q, 0q, Pq) can be expressed based on the modulus of the wave vector and 
the angles of the initial and final carrier states by using the momentum 


conservation rule, k = rs + 7. 

In this simulation approach, non-equilibrium effects for the other phonon 
modes are accounted for by a coupled set of energy balance equations for 
the optical and acoustic phonon branches, where the energy loss rate due 
to inelastic carrier-phonon scattering is used as the heat generation term 
[Kam15]. Furthermore, the effects of Ge content [Pal04], doping profile 
[Lee12], and boundary scattering [Vas10] in the reduction of the lattice 
thermal conductivity, as the main parameter that models heat conduction by 
acoustic phonons, were considered via analytical models. 


2.3.3 Hot Phonon Effects in a Calibrated System 


To investigate non-equilibrium effects for the carrier-phonon system in 
bipolar transistors, a state-of-the-art toward-THz SiGe HBT fabricated by 
Infineon Technologies AG within the framework of the European DOTFIVE 
project with an emitter width of Wg = 0.13 wm and a length of Lg = 
2.73 um, and belonging to a technology development stage referred to as set 
#3 in [d’Al14], was used to extract the thermal resistance based on simple 
DC measurements. The extracted thermal resistance from measurements 
Ry = 6,800 K/W [d’Al16] results in a junction temperature increase of 
AT, = 38.5 K at Vgg = 0.9 V and Vcg = 1 V with Ic = 5.66 mA. 

To study self-heating in this device, 2-D electrothermal simulations were 
performed for the structure, which is partly shown in Figure 2.2, the dop- 
ing profiles of which were extracted by secondary ion mass spectrometry 
(SIMS). For these simulations, a self-consistent steady-state solution of the 
BTEs for electrons, holes, and LO phonons coupled with the energy balance 
equations was obtained. Due to minor uncertainty in the extracted Ge profile, 
a calibration of the Ge content by a few percent is used to reproduce the 
measured Ic at Tg = 300 K by simulation. Figure 2.28 (top) depicts the 
thermal conductivity over the 2-D SiGe HBT by considering the effect of 
Ge content, doping concentration, and boundary scattering at 300 K. 

Since the SHE solution of the BTEs for a 3-D real space is too CPU 
intensive, only a 2-D real space was used and some important parts of the 
structure for thermal conduction, such as metal layers, were neglected. To 
mimic the 3-D nature of the heat propagation in 2-D simulation, the thermal 
boundary conditions are adjusted to match the simulated average lattice tem- 
perature increase over the space-charge region of the base-emitter junction 
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Figure 2.28 (Top) Thermal conductivity in the 2-D SiGe HBT structure by taking into 


account the effect of Ge content, doping concentration, and boundary scattering at 300 K. 
(Bottom) Self-consistent lattice temperature at Vgs = 0.9 V and Vcr = 1 V. 


equal to the extracted junction temperature from measurements [dď’ A102]. 
Hence, a convective boundary condition at the emitter contact is considered, 
and the heat transfer coefficient is calibrated to obtain an average junction 
temperature increase of 38.5 K from the temperature distribution of the 
simulation at the corresponding bias conditions. In this simulation, Neumann 
(adiabatic) boundary conditions are considered for artificial boundaries, the 
base and collector contacts, whereas the bulk contact at the bottom of the 
substrate is set to a constant temperature of 300 K. Figure 2.8 (bottom) shows 
the self-consistent lattice temperature obtained from the energy balance 
equations. 

The spatial distribution of the Joule-heating term and the energy loss rate 
due to in-elastic carrier-phonon scattering are shown along the symmetry 
axis of the device in Figure 2.29. This figure depicts the distance that carriers 
have to travel before releasing their energy to the lattice; therefore, carriers 
receive energy from the high electric field at the collector—base junction, 
while they lose their energy via net phonon generation deep in the collector 
region. Moreover, Figure 2.29 shows that carriers are mainly scattered by 


2.3 Advanced Electro-thermal Simulation 91 


1911 
= ,| E iB C 
Te : 
g ' 
is} 1 
2 4 f — Joule-heat 
= — En (Total) 
S 9 — En (LO) 
Uv [j 
5 1 
z ; 
6 
a 


100 150 200 250 300 
y [nm] 
Figure 2.29 Profiles of the power densities received and dissipated by carriers which 


are calculated from Joule-heating and energy loss rate due to inelastic phonon scattering, 
respectively, along the symmetry axis of the HBT at Vgs = 0.9 V and Vor = 1 V. 


LO phonons which can lead to a strong deviation in the LO phonon distri- 
bution function with respect to the equilibrium value evaluated at the lattice 
temperature in the collector region which is shown in Figure 2.30 (top). 

In order to investigate the effect of hot LO phonons and to make a 
comparison with the lattice temperature, an effective temperature is extracted 
from the non-equilibrium LO phonon distribution function which is shown in 
Figure 2.30 (bottom). The higher value of the effective temperature for LO 
phonons with respect to the lattice temperature, in the collector region, refers 
to the so-called phonon energy bottleneck in thermal conduction obtained for 
Top = 2 ps [Pop10]. The equality of Ty, and Tep around the base-emitter junc- 
tion reveals the negligible effect of hot LO phonons on the collector current, 
because the temperature at this junction dominantly determines the impact of 
self-heating on the collector current increase. However, the large difference 
between these two temperatures in the collector region might influence some 
electrical phenomena, such as impact ionization (II) due to hot electrons, 
which occurs mainly deep in the collector region. To examine this possibility, 
the injected current due to electron II (Irr) was calculated in a simulation with 
high collector—base voltage Vcg = 2 V. A stronger phonon scattering due to 
hot LO phonons obtained from the full electrothermal simulation leads to a 
lower number of hot electrons. However, the reduction of the Jj; at the same 
collector current due to temperature increase is just a few percent. As a result, 
the impact of hot LO phonons on electron II is not very strong in this case. 
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Figure 2.30 (Top) The LO phonon distribution function (zeroth-order harmonic), and (bot- 
tom) lattice temperature and effective temperature for LO phonons, along the symmetry axis 
of the investigated HBT at Ver = 0.9 V and Vor = 1 V. 


100 


2.3.4 Thermal Resistance Extraction from the Simulated 
DC Characteristics 


To extract the thermal resistance by an approach similar to the experimental 
extraction method [d’ A114], the required DC characteristics were calculated 
and compared with measurement data (Figure 2.31). Figure 2.31 (top) dis- 
plays the Ic — Vpr characteristics of the HBT at different homogeneous 
temperatures, compared to experimental data measured under DC conditions 
at various thermo-chuck temperatures. In these voltage/current ranges, self- 
heating can be safely disregarded and the results are used for calculating the 
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Figure 2.31 (Top) Jc — Vex characteristics for different homogeneous temperatures at 
Var = 0.6 V. Solid lines show the isothermal simulation results at Tg = 300, 320, 340, 360 K 
and symbols show the corresponding measurement data. (Bottom) Vaz — Vcg characteristics 
from electrothermal simulation and measurement at Tg = 2 mA. 


temperature coefficient y = — (OVgR/OTpR) |r. The extraction of the other 
parameter y = (OVpE/OVcp) |7, can be troublesome with the MC method, 
because the variation of Vgg with respect to Vcg for a constant Ip is very 
small. Moreover, thermal parameters of the device determine the slope of the 
Ver — Vop curve [Kam15]; consequently, a self-consistent electrothermal 
simulation is needed to evaluate the slope of this curve consistent with 
the lattice temperature distribution of the device. Figure 2.32 shows the 
Ver — Vcpg characteristics obtained from the deterministic and self-consistent 
electrothermal simulator in comparison with the measurement data. 
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Figure 2.32 Jc — Vse characteristics with and without including self-heating compared to 
measurement data at V cg = 1 V. 


Since the simulated DC characteristics are in very good agreement with 
measurement results, the extracted thermal resistance from electrothermal 
simulations Rpy = —y/(Igy) matches the value obtained from measure- 
ments. This confirms the consistency of the extracted junction temperature 
from the simulated DC characteristics with the average lattice temperature 
over the base-emitter junction observed in the temperature profile shown 
in Figure 2.30 (bottom). Therefore, this result attests the accuracy of the 
analytical model on which the experimental procedure is based. 

Figure 2.32 shows the effect of self-heating on the collector current 
increase observed in Ic — Vgg characteristics of the SiGe HBT by electro- 
thermal SHE simulations, which matches measurement results very well. 


2.4 Microscopic Simulation of Hot-carrier Degradation 


2.4.1 Physics of Hot-carrier Degradation 


Due to inevitable trade-offs in the performance optimization of SiGe HBTs, 
these devices are operated closer and even beyond the classical safe-operating 
area (SOA) borders. Hot-carrier degradation (HCD) is the main reliability 
concern in bipolar transistors that strongly limits the lifetime of a device 
operated close to the SOA limit [Fis15]. This degradation happens due to 
trap states generated by hot-carriers along the oxide interfaces over time. 
In general, imperfections at the Si/SiO2 interface lead to silicon dangling 
bonds, which can capture electrons or holes. Hence, these dangling bonds 
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Si dangling bond 


Figure 2.33 (Left) Creation of Si dangling bonds at the Si/SiOz2 interface. (Right) Passiva- 
tion of the dangling bonds by incorporating hydrogen atoms. 


are intentionally passivated by incorporating hydrogen atoms (Figure 2.33). 
However, hot carriers can supply enough energy to break the passivated Si- 
H bonds. A hot-carrier is a charge carrier which is accelerated under a high 
electric field inside the device and attains significant kinetic energy (higher 
than 1.5 eV) to break the bonds directly. Therefore, devices operating under 
bias conditions, which produce large electric fields, are susceptible to the 
HCD phenomenon. 

In high-performance HBTs, shallow trench isolation (STI) and deep 
trench isolation (DTI) schemes together with the emitter-base (EB) spacer 
oxide are used to reduce parasitic capacitances and leakage currents [Mar09]. 
However, trap states resulting from the Si-H bond dissociation at the EB 
spacer and STI oxide interfaces produce excess non-ideal base current via 
Shockley—Read—Hall (SRH) recombination in the forward mode and reverse 
mode, respectively (Figure 2.34). As a result, traps generated due to hot 
carriers along the EB spacer oxide interface degrade the main parameters of 


EB spacer oxide 


Collector 


Figure 2.34 Schematic of a state-of-the-art SiGe HBT with the corresponding EB spacer 
and STI oxides. 
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the device such as current gain and noise figure [Cre04]. Hence, a profound 
knowledge of the microscopic mechanisms of the interface trap generation 
and annihilation as well as their impact on the electrical characteristics is 
essential. 

Although conventional methods for a physics-based investigation of HCD 
in bipolar transistors, which are based on the lucky electron model, are 
electric field driven [Che09, Moe12, Wie16], it has been shown that the 
trap generation rate at the oxide interface is determined by the energy of the 
interacting charge carriers [DiM89, DiMO1]. Hence, another quantity called 
the acceleration integral (AI), which is calculated from the carrier energy 
distribution function (EDF), has been introduced to describe accurately the 
spatial distribution of the interface traps obtained from charge pumping 
measurement data [Stal1, Stal2]. In consequence, an energy-driven 
paradigm based on the AI has been developed to model both single- and 
multiple-carrier processes of the bond dissociation in the degradation analysis 
of the n-channel MOSFETs [Bin14, Sha15, Tya16]. 

This model has been recently extended to include the effects of both hot 
electrons and hot holes for describing the underlying mechanisms of HCD in 
bipolar transistors [Kam16, Kam17b]. For this purpose, a coupled system 
of BTEs for electrons and holes, which accounts for II and SRH, has to 
be solved. Since stochastic algorithms such as the MC method impose an 
enormous computational burden to resolve the high-energy tail of the EDF, a 
deterministic approach based on a SHE was used to solve the BTEs including 
full band structure effects [Hon1 1]. 

The reaction-diffusion model has been widely used to represent the com- 
plex dynamics of the trap generation and subsequent annihilation in HCD 
of bipolar transistors and negative-bias temperature-instability degradation 
of MOSFETs [Moe12, Rag15, Kuf07]. Despite a very good matching for a 
wide range of experimental observations, it has been shown that the reaction- 
diffusion model is inconsistent with the measurement data at the microscopic 
level [17]. Therefore, in [Kam17b] a degradation model based on the Als 
was used to calculate the dispersive bond-breakage rates associated with a 
reaction-limited model to describe HCD effects in a SiGe HBT. 


2.4.2 Modeling of Hot-carrier Effects 


In an energy-driven framework, the bond-breakage rate is modeled by con- 
sidering the interaction of the incident charge carriers with the passivated 
Si-H bond. A Si-H bond is represented as a truncated harmonic oscillator 
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AB-process 


Figure 2.35 The energy configuration of the Si-H bond modeled as a truncated harmonic 
oscillator. 


characterized by a system of eigenstate energies [Sto98], which is depicted in 
Figure 2.35. 

Bond dissociation occurs via excitation of one of the bonding electrons 
to the transport state, which is known as an antibonding (AB) process. As a 
result, a repulsive force is induced, which detaches the hydrogen atom. The 
dissociation rate from the zth state of the Si-H bond with the energy Ej, 
triggered either by a hot electron or by a hot hole, is given by: 


Rapa = 4p; + T} Bitur exp (—(Ea — Eord — Ei)/kgTo) (2.37) 


where I$ p; and [ h p; are the AB acceleration integrals of electrons and 
holes, respectively, v, is an attempt frequency, and E, is the bond-breakage 
activation energy, which is reduced due to the interaction of the bond dipole 
moment d with the oxide electric field E,,. Furthermore, to account for the 
fluctuations of the activation energy, a Gaussian distribution is considered 
with a mean value and standard deviation of (Ea) and oz, respectively. The 
AI for the AB process is given by [Bin14] 


oO 
risum oa f [DA (Eug (DIE - Eing)/Eres?] aE 
o (2.38) 
where Fini = Ea — Eoxd — F; is a threshold energy for the ith level, cae 
is the AB reaction cross section, f°/” (E) is the carrier distribution function, 
z°/h (E) is the carrier density of states, vol i (E) is the carrier group velocity, 
p = 11 is an empirical parameter, and Epef = leV. 
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If a charge carrier does not provide enough energy to trigger the AB 
mechanism, it can still contribute to the bond-breakage procedure via multiple 
vibrational excitation (MVE) of the bond. In an accumulative consideration 
of the MVE and AB mechanisms, the bonding electron can be firstly excited 
by several colder particles to an intermediate energy level, and then disso- 
ciated by a carrier with a relatively high energy. The bond excitation and 
deexcitation rates triggered by either a cold electron or a cold hole, are given, 
respectively, by 


Pu =Tnye + Tye + We exp (—hw/kpTo) (2.39) 


Pa =WTigyeg tive + Wwe (2.40) 
where we is the reciprocal phonon life-time and Aw is the energy distance 
between the Si-H energy levels. The AI for the MVE process is defined as 


Tin = 0YE pc [8° (B) 241" (B) of!” (E) KE — fw) /Exeg]] dE 


(2.41) 
The cumulative bond-breakage rate, which accounts for all possible superpo- 
sitions of the AB and MVE mechanisms, is calculated as 


ke y. Rap,i( $) (2.42) 


where k is a normalization prefactor defined as k = >, (P,,/ Pi)’. 
In the reaction-limited approach, the rate equation for the generation and 
recombination of the interface trap states is written as [Jep77] 


ONz/Ot = (No — Nit) Ra — Nit? Rp (2.43) 


where N; is the density of the generated interface traps, No is the density of 
the primary passivated Si—H bonds, and Rọ is the recovery rate. 

In this approach, the dispersion of the bond-breakage energy deter- 
mines the power-law time dependence of the HCD results. The dispersive 
effect of Ea is incorporated by discretizing an energy grid in the range of 
(Ea) — 30 g, (Ea) + 30g] and evaluating Nit for each discretization point. 
The interface trap density profile, which is the combination of every single 
defect, is obtained by calculating the average of Nit at each energy point 
weighted by the Gaussian distribution [Bin14]. 

To obtain the required distribution functions of the carriers, a coupled 
system of the BTEs for electrons and holes has to be solved. The BTE for 
electrons in the stationary case is written as: 


LAF = Stfy+O{fe, fey Tee, f} (2.44) 
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where S {f°} is the scattering operator, which accounts for carrier-phonon 
scattering, impurity scattering, alloy scattering, and II scattering of primary 
particles, Q { IET ny is the generation operator of secondary particles due 
to II [Jab14], and T° { tS. f h) is the SRH recombination operator [Jun07, 
Rup16] defined on the boundary of the oxide interface. 

In this simulation approach, a full-band SHE simulator is used to obtain 
the carrier EDFs for a SiGe HBT under stress conditions. Then, the Niz (7, t) 
profile calculated at each stress time step is fed into the SHE solver to 
calculate the characteristics of the degraded device which change due to SRH 
recombination. 


2.4.3 Simulation of SiGe HBTs under Stress Conditions Close 
to the SOA Limit 


The conventional mixed-mode (MM) stress conditions, which are the concur- 
rent applications of a high collector-base voltage and a high collector current 
density to accelerate the degradation procedure, set an upper limit for HCD of 
the SiGe HBTs during RF operation [Fis15]. However, as the main drawback 
they are far from typical operating conditions. Hence, to study the physics 
behind the long-term base current degradation under more practical operating 
conditions, three stress bias conditions P1, P2, and P3, along the border of 
the SOA, were selected to degrade the device up to 1,000 h at 300 K, and 
Gummel plots (Vcg = 0 V) were measured at certain stress time intervals 
[Jac15]. The corresponding bias voltage, current, and junction-to-ambient 
temperature increase obtained from the extracted thermal resistance Rypy = 
2,850 K/W [d’Al14] are summarized in Table 2.1. Measurements showed 
that the examined npn SiGe HBT is negligibly affected by stress at P1, and 
P3 exhibits a higher base current degradation over time in comparison to P2. 

For numerical simulations, a 2-D structure, the doping profiles of which 
were extracted from SIMS, was used. As a first step of this analysis, the 
simulator has to be calibrated to reproduce the measured Gummel plot and 
Ig — Vcr characteristics of the fresh device. The base current reversal at 


Table 2.1 Definition of the stress bias conditions P1, P2, and P3 and their corresponding 
junction temperatures 


Pl P2 P3 
Vor [V] 1 (<BVceEo) 2 (>BVceo) 3 (>BVcEo) 
Jo [mA/um?] 10 5 1 


AT; [K] 37 37 11 
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Vcr >BVcro due to avalanche multiplication of carriers is used as a basis to 
determine the II rates initiated by primary electrons and holes. Figure 2.36 
shows the II generation rates at P3 due to electrons and holes separately. 
Under this stress condition, the high electric fields within the collector— 
base junction accelerate electrons to reach enough energy required for initi- 
ating avalanche generation of electron-hole pairs via II. Figure 2.36 shows 
that hot electrons responsible for II are deep in the collector region while hot 
holes are mainly found in the base region. To obtain a better understanding 
of the energy of the carriers which participate in the degradation process, 
Figure 2.11 depicts a cut of the EDFs for electrons and holes along the 
symmetry axis of the investigated SiGe HBT with respect to kinetic energies. 
Electrons move toward the collector region and gain sufficiently high 
energies to initiate II, whereas the holes generated by II in the collector due 
to hot electrons [Figure 2.36 (top)] are accelerated toward the base and gain 
a lot of energy. Due to this high energy, some of the holes can shoot through 
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Figure 2.36 II generation rates in the SiGe HBT induced by electrons (top) and holes 
(bottom) at P3. 
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Figure 2.37 Cut of EDFs [eV~' cm™°] for electrons (top) and holes (bottom) along the 
symmetry axis of the HBT at P3. 


the base into the emitter, where they still have a relatively large energy 
[Figure 2.37 (bottom)]. A certain fraction of these hot holes hit the EB spacer 
oxide interface, where they might break Si—H bonds. 

This effect can only be captured by a model which resolves the depen- 
dence of the carriers on energy. Thus, it is not possible to directly describe the 
behavior of the hot holes with a drift-diffusion or a hydrodynamic model, in 
which the hole gas in the base is assumed to be close to equilibrium. Unavoid- 
ably, to perform physics-based degradation analysis relying on the classical 
models, the probability of hot carrier creation has to be calculated via the 
lucky electron model. This calculation based on the effective electric field 
shows inaccurately that hot holes are found at the collector—base junction. 
Subsequently, the possibility that a hot carrier reaches the oxide interface 
without any deflection has to be separately estimated [Moe12]. 

The interface traps located within the EB space-charge-region have the 
highest impact on the forward mode base current degradation via SRH 
recombination. Therefore, the EDFs at the intersection of the EB junc- 
tion and the oxide interface are compared for different stress conditions in 
Figure 2.38 (top). The negligible role of electrons in the degradation process 
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Figure 2.38 (Top) EDFs of electrons (dashed lines) and holes (solid lines) at the intersection 
of the EB spacer oxide interface and the EB junction [denoted by node X in Figure 2.36 
(Bottom)]. Profiles of the AB Als for electrons (dashed lines) and holes (solid lines) along the 
EB spacer oxide interface from node A to C denoted in Figure 2.36. 


is concluded from the EDFs for electrons at P1 and P2, which exactly follow 
the equilibrium EDF, and at P3, with a low-energy hump. Because of small 
collector—emitter voltage in P1, even holes do not gain high energies, which 
explains the negligible degradation rate at P1 observed in measurements. 
Furthermore, the high-energy tails of the hole EDFs at P2 and P3 reveal 
the dominant role of hot holes in the degradation process under the stress 
conditions close to the SOA limit, which was also reported for conventional 
MM stress conditions in [Van06]. However, the deterministic solver provides 
the possibility to comprehensively describe the microscopic effects of hot 
carriers and accurately obtain the EDFs up to high energies in a realistic 2-D 
device structure and develop a practical physics-based degradation model to 
evaluate the resulting excess base current. 
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Although cold carriers can also participate in the MVE of the Si—H bonds, 
hot carriers with energies greater than 1.5 eV have a much higher chance 
to break the Si-H bonds directly [Tyal6]. Moreover, the activation energy 
parameters determine the power-law time dependence and the dependence 
of the excess base current on the stress conditions. Hence, (Ea) = 1.6 eV 
and og = 0.2 eV, which are in good agreement with those experimentally 
obtained [Ste96, Pob13], were considered for the activation energies of the 
bond-breakage to achieve good agreement with measurement data. 

The AB Als along the EB spacer oxide interface obtained from this 
calibration are depicted in Figure 2.38 (bottom). As an expected result, 
the equilibrium electron EDFs at P1 and P2 result in zero AB Als along the 
oxide interface. Furthermore, due to the low-energy humps in the EDFs of 
electrons at P3 and holes at P1, their corresponding Als are very small. In 
consequence, the measured base current degradations under P2 and P3 have 
to be ascribed to hot holes with relatively high AB rates, in which the bigger 
AI at P3 compared to P2 explicitly explains the bigger degradation current 
under this stress condition. 

Figure 2.39 represents the trap densities along the EB spacer oxide 
interface for several stress time steps, which are significantly generated by 
the AB process due to hot holes. These interface trap densities, calculated 
for No = 10!? cm~, reveal that the large variation of the AB Als along the 
oxide interface results in a strong non-uniformity in the spatial distribution of 
the Ni: profile. 
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Figure 2.39 Interface trap densities generated at different stress time steps from node A to 
C denoted in Figure 2.36 at P3. 
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The generated traps at the EB spacer oxide interface cause a non-ideal 
increase in the forward mode base current via field-enhanced SRH recombi- 
nation [Hur92]. Figure 2.14 (top) shows the Gummel characteristics of the 
fresh device and the degraded device after 1,000 h stress application at P3. 
The leakage currents observed for the fresh device due to packaging [Jac15] 
have no impact in the degradation analysis and are not taken into account. 
Since the recombination process has no considerable impact on the collector 
current, the resulting increase in the base current degrades the current gain of 
the transistor. 
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Figure 2.40 (Top) Gummel characteristics (Vcs = 0 V) of the fresh and degraded SiGe HBT 
after 1,000 h at P3 obtained from simulation (lines) and measurement (symbols). (Bottom) 
Excess base currents over the stress time obtained from simulation (lines) and measurement 
(symbols) at Vgs = 0.67 V and Vop = 0 V. 
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In order to assess the time dependence of the HCD effects, the excess 
base current ATg = Ip (t) — Ig (0) is extracted over stress time [Figure 2.14 
(bottom)]. Very good agreement between the simulation results and measure- 
ment data proves that the EDF-based degradation model can directly explain 
the time dynamics of the HCD results together with their dependence on the 
stress bias conditions. 
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Abstract 


Fabrication and circuit design are linked by compact device modeling; i.e., 
the electrical characteristics of the devices fabricated on a wafer are rep- 
resented by sufficiently simple but preferably still physics-based models 
that are suitable for circuit simulation and optimization. The importance 
of modeling has been growing rapidly due to strongly increased device 
complexity, manufacturing cost, and fabrication time. There is an increased 
demand from industry for first-pass success of high-frequency (HF) ana- 
log circuits in order to stay competitive. For SiGeC HBT technologies, 
ranging from production to the most advanced process, this has been 
successfully addressed by the standard compact bipolar transistor model 
HICUM/L2 [Schr10]. 

For practical applications, a compact model (CM) itself is not sufficient 
though. Its model parameters need to be determined from measurements of 
terminal (current, voltage) characteristics, preferably making use of clever 
test structures and mathematical manipulations (so-called parameter extrac- 
tion methods) in order to be able to separate the various, often superim- 
posed, physical effects and their related parameters. Consistent physics-based 
parameter extraction methods that provide for a given process accurate 
geometry-scalable and statistical device models not only represent a key 
enabler to first-pass design success but also yield important information for 
process development. One objective of DOTSEVEN was the development of 
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improved or even new parameter extraction methods and to provide a unified 
set of test structures. 


3.1 Introduction 


The predicted THz performance of SiGeC HBTs along with their integration 
with digital CMOS indicates a bright future of the corresponding BiCMOS 
technologies for serving HF applications [Schlla, Sch11b, Sch16a]. These 
predictions and the rising demand for mm- and sub-mm-wave! applications 
have motivated the development of improved SiGeC process technologies, 
leading to the most recent results [Heil6] of (fT, fmax) = (505, 720) GHz 
fabricated with a 130 nm lithography within the DOTSEVEN project. 

The large variety of HF applications requires a versatile and accurate 
representation of such process technologies within the design kits in order 
to enable circuit optimization and exploiting the performance limits of the 
technology. Thus, an important focus of DOTSEVEN was the development of 
suitable simulation and modeling tools as well as the verification of the new 
models [Schl6c]. The corresponding effort on compact modeling of high- 
speed SiGeC HBTs and the associated experimental results are presented in 
this chapter. 

Compact models — which are also sometimes called electrical models 
or SPICE models — were introduced in the 1970s as a constitutive and 
inseparable part of simulators for electronic circuits. Based on a set of 
parameterized analytical equations, CMs are meant to provide a suitable rep- 
resentation of the electrical characteristics of electronic devices under given 
bias, frequency, and temperature conditions. The large variety of applications 
in the Si industry has led to a strong preference for physics-based CMs, in 
which (i) the formulations for current and charge are derived as simplified 
solutions of fundamental equations for carrier transport and electrostatics, 
(ii) most of the parameters retain a physical meaning, and (iii) the equivalent 
circuit corresponds to the physical structure of the device. Such physics-based 
models enable not only device sizing-based circuit optimization but also 
efficient modeling of statistical process variations and process debugging. 

The determination of the model parameters from device measurements, 
typically called parameter extraction, includes the specification of measure- 
ment conditions and the mathematical procedure for data manipulation for 
obtaining the desired parameter values. This task is sometimes, especially in 
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the II-V community, called model extraction. This incorrectly implies the 
“construction” of the CM, which is actually not included. Similarly, often 
just the term model is used referring to both the CM (formulation) and its 
associated parameter set. Since for the vast majority of application cases the 
CM is given, the usually tedious and lengthy task of its development should 
be distinguished from the task of extracting its parameters. 

While CMs are developed independently of circuit simulators, the avail- 
ability of high-level description languages (such as Verilog-A, cf. e.g., 
[Muk16]) and associated model compilers has significantly sped up not only 
the model implementation but also its release for all commercial simulators 
in the form of a single (reference) model code. 

The various tasks mentioned above eventually lead to a working CM. 
When evaluating its accuracy, different aspects come into play, which are 
sometimes a source of confusion. The necessary conditions for obtaining an 
accurate CM are listed below: 


1. The intrinsic ability of a CM to describe a device, i.e., the versatility and 
accuracy of its constitutive equations and its equivalent circuit. 

2. The model coding by its developers (e.g., in Verilog-A), e.g., correctly 
modeling capacitances by their respective charges, following from the 
integration of the capacitance between equilibrium and the controlling 
voltage(s). 

3. A correct model implementation by the EDA vendors in their circuit 
simulator. Even when using appropriate tools, like model compilers, this 
step induces many non-trivial optimizations and customizations, which 
can be a source of error. 

4. The accuracy of the model parameters, i.e., how accurate the individual 
electrical characteristics of the model agree with those of the measured 
device to be represented. 


Only meeting all of the above criteria with sufficient quality will provide 
the desired CM accuracy for a given technology or process. No matter how 
accurate and versatile the CM itself may be, poorly determined parameters 
will ruin the effort spent not only on its development but also for process 
development and circuit design. 

Compact modeling of high-speed SiGeC HBTs within DOTSEVEN 
addressed two main aspects. First, libraries with geometry scalable model 
parameters for the HBTs fabricated in the project were provided, enabling 
the design of mm-wave circuit building blocks and demonstrators. Second, 
CMs in combination with TCAD were employed for analyzing the process 
in terms of causes of performance reduction by separating 3D parasitic 
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effects from 1D transport effects in the intrinsic device structure [Kor15]. The 
compact HBT modeling in the project was based on HICUM Level 2 (L2), 
which has been a CMC supported standard since 2004. Compared to other 
existing CMs, HICUM/L2 has been continuously developed for HF/high- 
speed HBT technologies and applications and offers various HF-specific 
features as well as high accuracy up to higher frequencies and over a wider 
bias range. This chapter highlights recent model extensions that are rele- 
vant for DOTSEVEN-like technologies. Furthermore, important steps during 
parameter extraction are presented with an emphasis on the model extensions 
and physics-based geometry scaling. In addition, most recent evaluations of 
methods for determining the series resistances are briefly discussed. 

Section 3.7 gives an overview about publications showing comparisons 
of DC, AC and non-linear large-signal characteristics with experimental data. 


3.2 Overview of HICUM Level 2 


A detailed derivation of the formulations of HICUM/L2 can be found in 
[Sch10] and is beyond the scope of this book. Below, just a brief overview of 
the model components is given to provide a reference for further discussions. 

The equivalent circuit of HICUM/L2 is displayed in Figure 3.1. The 
intrinsic (1D) transistor behavior is described by the controlled current source 
for the transfer current Iry, that is calculated based on the generalized integral 
charge-control relation (GICCR, cf. Section 3.3), the dynamic currents result- 
ing from the time-dependent depletion charges (Qjmi, Qjci) and diffusion 
charges (Qari, Qaci), and diodes for the currents injected into the emitter 
(pri) and collector (Ijgci). Collector impact ionization is represented by 
the current avı. As proved in [Sch16b], the complicated behavior of the 
(ohmic) intrinsic collector (epi) region can be accurately described within the 
GICCR framework and does not require a separate element with a typically 
complicated description as it is the case in some other CMs. 

Laterally distributed effects in the internal base region are modeled by 
the bias-dependent internal base resistance Rp, which takes conductivity 
modulation and emitter current crowding into account. Dynamic emitter 
current crowding during small-signal operation is modeled by the parallel 
capacitance Cppi. 

The injection across the emitter perimeter junction is represented by 
QjEp and Ijgep. The current source IBET(i,p) models band-to-band (BTB) 
tunneling and can be connected either to the internal (B’) or to the perimeter 
(B*) base node, depending on the transistor architecture. 
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Figure 3.1 Equivalent circuit of HICUM/L2 including the adjunct networks for modeling 
electro-thermal effects and NQS effects. Not shown are the networks for modeling corre- 
lated noise [Her12, Sak15, Sch10]. The dash-dotted line defines the intrinsic (1D) transistor 
representation and the dashed line defines the internal transistor. 


The external BC region is modeled by the junction current Ijgcx and 
the dynamic currents through the time-dependent charges Qjcx and Qjcx”. 
The latter include the depletion charge in the external base as well as the 
oxide capacitance of the shallow trench and contact region related parasitic 
capacitance between the base and the collector. The charges are split across 
the external base resistance to account for distributed lateral effects at high 
frequencies. The emitter contact and poly-silicon resistance are represented 
by Rp, while the external collector resistance Rcx includes the collector 
contact, sinker, and buried layer contributions. The capacitances CBE, pari 
and CBE, par2 include the BE spacer and parasitic poly-silicon contact region 
related capacitances between the base and the emitter. 

The substrate transistor is modeled with simple expressions for the trans- 
fer current source Irs and the respective back-injection current Ijgc. The SC 
depletion charge is modeled by Qjs, and a simple bias-independent storage 
time is used for describing the diffusion charge Qas. 
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Substrate coupling effects are described by a simple first-order frequency 
dependence with Rsu representing the finite resistance of the path between 
sub-collector and substrate contact and Cg, caused by the permittivity of the 
bulk substrate. An improvement of this simple equivalent circuit is presented 
in Section 3.5. Note that, based on the final circuit layout, any elaborate 
equivalent circuit can be connected to the substrate node. 

Electro-thermal effects are taken into account by a simple first-order low- 
pass network consisting of the thermal resistance Rip and thermal capacitance 
Cin. The externally available temperature node allows the connection of both 
higher order and thermal coupling networks if required by the application 
[Leh14]. 

The extraordinary performance of SiGeC HBTs in DOTSEVEN has been 
achieved with changes in the transistor architecture. Changes in lateral direc- 
tions result in a modification of both geometry scaling laws and the equivalent 
circuit. The most relevant aspects here will be discussed in Sections 3.5 
and 3.6.3. Structural changes in the vertical direction impact the intrinsic 
transistor behavior. They have been accounted for in the formulations for 
the transfer current and stored charge, which will be discussed in the next 
two chapters. In versions of HICUM/L2 previous to 2.3, those effects were 
not considered explicitly, which led to increased effort for parameter extrac- 
tion using conventional methods, potentially yielding non-physical model 
parameters. 


3.3 Modeling of the Quasi-Static Transfer Current 
3.3.1 Basics of the GICCR 


As shown in [Sch10, Sch16b], the GICCR can be derived as closed-form 
solution from integrating the 3D drift-diffusion transport equation. This sec- 
tion summarizes the relevant features of the GICCR by focusing on just the 
vertical (1D) npn transistor structure. In this case, the 1D GICCR master 


equation reads 
(qAg)?VrinBripe Vg’ Vac \| _ 
exp Pi, = Irt — Irr, 
T 
(3.1) 


Qph Vr 
with the elementary charge q, the emitter area Ag, the thermal voltage V7, the 
electron mobility un, the intrinsic carrier density n;, and the voltages between 
the terminals of the 1D transistor Vg’ and Vpc’. The denominator results 


Ip = 
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To 
Qoa = Ana | ha)ple)de (3.2) 
Th 
from integrating the transport equation from the mono- to poly-silicon emitter 
interface, which defines the 1D emitter contact xg’, to the peak of the buried 
layer, which defines the 1D collector contact xœ - Qpp is a weighted hole 


charge, with p being the hole carrier density and the weight function h(x) 


h(x) = hg(x)hj(x)hy (x). (3.3) 
Its first component, 
— BPnBNip2 
h(x) = EET (3.4) 


accounts for all effects related to the field-dependent electron mobility and 
the spatially varying bandgap. The normalization factor »p77;p is taken as 
average value over the neutral base. The second component, 
AgJ, 
hla) = -FEA (3.5) 
T 

is, in the 1D case, related to volume recombination, while in the 3D case it 
also accounts for the lateral spreading of electron current density, typically in 
the collector. The third term 


hy (a2) = exp (Beet) (3.6) 


takes into account the spatial variation of the hole Fermi-potential p w.r.t. the 
chosen controlling (node) voltage. The deviation is expected to be negligible 
across the vertical neutral base region, but can become significant in the 3D 
case for emitter current crowding. 

The spatial dependence of the components (3.4) and (3.5) in Figure 3.2(a) 
shows that hj and hy are close to 1 in those regions where the hole density 
matters. 

In contrast, the weight function hg shown in Figure 3.2(b) follows mostly 
the spatial dependence of the bandgap, except for the BC depletion region, 
where both mobility and hole density are much lower than in the other 
transistor regions. As a consequence, for the ip formulation of the intrinsic 
transistor only the impact of hg and, in particular, of the bandgap needs to 
be taken into account. The influence of hj and hy can be included in the 
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Figure 3.2 Spatial dependence of the weight functions (a) Ay and hy for Jc = 5mA/um?, 
and (b) hg (solid line) for low injection. In both pictures the dotted line shows the 1D doping 
profile in log-scale. In (b), the dashed line shows the bandgap in linear scale. 


3D formulation through the weighted charge formulation (collector current 
spreading) and the internal base resistance (emitter current crowding). 

For compact modeling, it is useful to split Qpn into several components 
according to the device structure and transistor operation principle. The most 
suitable split leads to the sum of a zero-bias hole charge Qpo and an excess 
charge AQ pn given by the change Ap of the hole density with non-zero bias: 


Zo 


AQph = af h(x) Ap(x)dx = hjriQjri + hyciQici + Qm. 87 


E’ 


Here, Qjri and Qjr; are the physical depletion charges of both junctions with 
the corresponding weight factors hj; and hjci, and Qp is the weighted mo- 
bile charge in the transistor. The normalization factor [{({,;B7{p2 in hg and (3.1) 
is chosen such that the weight factor for Q,9 becomes 1. The other weight 
factors in (3.7) are defined as average values in the corresponding transistor 
region k: 
J, h(w)Ap(a)da q fy h(a) Ap(a)dx 
Jy Ap(a) dx Qk l 
The weighted mobile hole charge in (3.7) is further divided according to the 
different transistor regions: 


hp = 


(3.8) 


Qm = hotolre + he AQ + AQme + hQ + Trlr- (3.9) 


Here, Ips and Iry are the forward and reverse transfer currents, respectively, 
as defined in (3.1), with ty, and t, being the corresponding (low current) 
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transit times. AQ, AQ sp, and Qrc are the physically stored excess charges 
in the emitter, base and collector regions at medium and high current densities 
at forward operation, while fo, hfg, and hpc are the corresponding weight 
factors according to (3.8). The weight factor for the base charge is left 
close to 1 since the value of hg is close to 1 (cf. Figure 3.2(b)) due to the 
choice of MnBnipgz2. To keep the model simple. Also no dedicated weight 
factor is used for the reverse charge. Note that although Q,), is related to 
the actual hole charge, it does not have a physical representation by itself in 
the transistor. Thus, in contrast to the actual charge, it cannot be measured 
directly. 

In summary, the GICCR in the form of (3.1) represents a physically 
consistent closed-form description for the transfer current of HBTs, which 
also provides clear guidance on how additional effects need to be included. 
The following sections describe the extensions of the GICCR presented 
above with respect to the advanced SiGeC HBT technology developed in 
DOTSEVEN and related projects (DOTFIVE, RF2THz). 


3.3.2 SiGe HBT Extensions 


Within DOTFIVE and DOTSEVEN very different approaches toward SiGe 
HBT technology development were taken. When investigating the fabri- 
cated HBTs, significant differences in the ideality of the transfer current 
characteristic and in the current density dependence of the corresponding 
transconductance had been observed. An example of this behavior is shown 
in Figure 3.3(a). The observed differences were eventually traced to back 
to different Ge profiles in these technologies, in particular within the BE 
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Figure 3.3 (a) Transfer current for transistors with different Ge profiles in the base. (b) 
Transfer current normalized to their ideal formulation for the same transistors as in (a) at 
room temperature and Vg/cœ = 0 V. 
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space-charge region (SCR), cf. [Paw14a]. While spatially constant Ge profiles 
show the expected almost ideal behavior, grading the Ge already within the 
BESCR led to a significant deterioration in ideality and the transconductance 
already at medium collector current densities. 

Normalizing the transfer current to its ideal form, Is exp(VepE/Vr), 
magnifies the above mentioned non-ideality as shown in Figure 3.3(b). 
This increase in non-ideality was first noted in [Cra93] and later mod- 
eled in [Paa01] by modifying the Gummel number. In this section, the 
origin of the effect will be briefly reviewed and then its incorpora- 
tion into the transfer current expression by applying the GICCR will be 
demonstrated. 

A graded Ge profile leads to a spatially dependent bandgap in the 
base region which in turn leads to a spatially dependent intrinsic carrier 


density 
A 
ni (x) = nio exp ( we) (3.10) 


with nio representing pure Si and AV,(> 0) as the bandgap reduction due 
to the Ge mole fraction. Since the bandgap decreased with increasing Ge 
content, the intrinsic carrier density increases with x. 

According to the classical theory for bipolar transistors, the electron 
density ne injected at the beginning xe of the neutral base (cf. Figure 3.4) 


is given by: j 
` Vy An 
pe niee exp ( a ) , (3.11) 


With increasing forward voltage Vp’p across the BE SCR, xe moves towards 
the junction and thus smaller values of n;. This in turn reduces the injected 
electron density compared to the ideal case of a spatially independent n; and 
leads to the non-ideality observed in Figure 3.4.7 

The varying n; enters the GICCR via the weight function (3.4), which 
needs to be inserted into (3.7) in order to calculate the weight factor hjp; 
for Qjni. In the following derivation, a spatially constant base doping profile 
is assumed for tjp < v < zeo. Also, for simplifying the expressions, 
a coordinate transformation is applied such that xj_ is chosen as new 
reference: 


go =2— LjE- (3.12) 


Note that from Figure 3.4 the depletion charge follows as Qjzi = qNp(aeo — Xe). 
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Figure 3.4 Visualization of the depletion charge in the base-emitter space charge region for 
two bias points with increasing voltage from the top picture to the bottom. The intrinsic carrier 
density is given in log-scale. xjm relates to the metallurgic BE junction, while xe and xeo, 
respectively, are the boundaries of the space charge region for Vp-z > OV and Vpg’e = OV, 
respectively. 


Assuming a fully depleted SCR, i.e., p = 0 for xjm < £ < Xe, and p = Np in 
Le < £ < Teo, the weight factor is given by: 


Inserting (3.4) with (3.10) and assuming a spatially independent electron 
mobility yields: 


2, - AV, A AV, 
banin (* exp (S44) d, cime J exp (240) de 


hec Unni jE /Xe0 T 
= Te — Log Le — Leg 
(3.14) 
with nj j = ni(xje). The spatially linear bandgap variation is given by: 
AY, z / / 
AV, (x) = —2me" ==, (3.15) 


VIT’ Vg, max Qni 
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with the maximum bandgap change AV, max and the location of this 
maximum, Tyg max: Inserting this into (3.14) leads to 


exp (-4) — exp ( a (3.16) 


i a Bot 
Te Leg 


hjEi = —@niChBE 


Following classical theory, the width of the SCR is 


Vp, 1 
x! = 2'94/1— Te (3.17) 
1 


T Vpg'g 

za) 7E G (: : Vori )) ; 3 18) 
, Vara l l 
sa (1-1) 


A similar expression can also be obtained for an exponential dependence of 
the base doping profile within xjẹ < £ < zeo. 

Replacing the square root by the grading coefficient zgi, which allows to 
capture realistic base doping profile shapes, yields the final model equation 


Vang \ Ei 
exp (am (1- (1) -1 
hjgi = hjkio F = (8.19) 
jEi Vor: 


DEi 


which finally leads to 


hjEi = ChBE eXP ( 


where various quantities in (3.18) have been merged into the model parame- 
ters anjri = Xho/ani and hjzio = ChBE exP(—anjri). 

The application of the model is shown in Figure 3.5 for transistors with 
different Ge shapes. A weaker grading means a smaller value of AV; max in 
(3.15) leading to a smaller increase of the weight factor. For all cases, the 
model equation yields very accurate results. For the box profile, no grading 
is present, leading to ajg; = 0. In the model, a series expansion of (3.19) is 
used for small aņpjgi, which leads to hjgi = hjgio for apjgi = 0. Furthermore, 
the singularities at Vg' = OV and at Vergy = Vpr; are avoided in the 
model formulation. 

For a Ge profile increase throughout the entire base region, i.e., including 
the base portion of the BC SCR, also the weight factor hjc; becomes bias 
dependent but decreases with increasing voltage Vgc. The behavior of hjci 
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Figure 3.5 Application of the model Equation (3.19) for the weight factor obtained from 
transistors with different shapes of the Ge profile [Paw15a]. 


can be described by a similar function as (3.19) and its own set of parameters. 
The value of hjc; decreases with the slope of the Ge grading, which results in 
a reduction of the Early effect. A simple physics-based explanation for this 
is that the Ge grading causes a strong aiding drift field Enx across the base 
region. Once the injected electron current density J, is dominated by drift, 
the position of the BC SCR boundary x, at the end of the neutral base has 
only a weak impact on the value of Jn. In other words, in a box Ge profile 
(1.e., Enx ~ 0) Jy is driven the diffusion gradient of the injected carriers. This 
is visualized in Figure 3.6 by the curve ¢ ~ 0, where 


Enx 
Vr/wg 


represents the field factor and the normalization factor Vr /wp corresponds 
to the equivalent field of a pure diffusion current. For a high field, the 
carrier gradient disappears as shown in Figure 3.6 for different values of 
¢ > 0. Hence, moving the location of xe does not change Jn anymore, thus 
resulting in the observed larger Early voltage for graded base HBTs. Since 
the bias dependence of the corresponding weight factor hjc; is very small, no 
dedicated model equation for hjci(V grc) is included in HICUM/L2. 

Except for the non-idealities in the transfer current at low injection that 
were explained before, also the increasing non-linearity of SiGe HBTs with 
graded Ge compared to transistors with a box Ge profile can be seen even 
before the onset of high-current effects. An explanation for this phenomenon 


ae (3.20) 
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Figure 3.6 Spatial dependence of the electron density normalized to ne from (3.11) in the 
neutral base, marked by the vertical dashed lines, for different values of the field factor ¢. 
The x-axis is normalized to the metallurgical base width wgm with the BE junction located at 
z=0. 


can be found by a closer look at the different parts of the mobile charges and 
the relation to the corresponding weight factors according to the GICCR. 

The reason for the increased non-ideality is also indicated by the voltage 
drop from the internal emitter contact F’ at x = 0 to £ = £e. The voltage 
drop is given in Figure 3.7 for transistors with different Ge profiles. The corre- 
sponding hole densities are given in Figure 3.8(a). It can be seen that although 
the voltage drop in the emitter is the same for all transistors up to medium 
current densities, the voltage drop in the BE SCR for a given current density is 
significantly larger for the transistor with the graded Germanium profile. This 
is caused by the neutral charge in the SCR. As shown in Figure 3.8(b), for a 
given collector current density this charge is visibly smaller for the graded 
Ge compared to the box Ge. Since however the current density is defined by 
the electron density and the gradient of the quasi-Fermi potential, the reduced 
carrier density can only be compensated for by an increased voltage drop for 
maintaining the current density. 

For modeling though, only the potential at the E’ is known but the voltage 
drop not calculated explicitly. However, it was shown in, e.g., [Paw15a, Fri02] 
that a voltage drop in the internal transistor can be directly expressed by the 
weight function in the GICCR. The resulting weight factors for the mobile 
charge in the emitter, 
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Figure 3.7 (a) Determination of the voltage drop in the neutral emitter (AV) and in the 
BE SCR (AV gg) from the quasi-fermi potential of the electrons for transistors with different 
Ge profiles in the base. The dashed lines show the begin and end of the BE SCR. (b) Current 
dependence of the voltage drops; AV gx is shown for bias points only up to the beginning of 
high-current effects [Paw 15a]. 
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Figure 3.8 (a) Spatial dependence of the hole density for transistors with different Ge 
profiles. The Ge content is given by the dotted lines for the box (left) and the graded (right) 
profile. The vertical dashed lines are the same as in Figure 3.7(b) Minority charge as a function 
of collector current density in the BE SCR for transistors with different Ge profile [Paw 15a]. 
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Figure 3.9 Current dependence of the weight factors for the charge stored in the neutral 
emitter and BE SCR for the transistors with different Ge profiles [Paw15a]. 


respectively, show just a weak current dependence (see Figure 3.9). While 
hep, equals 1 for BJTs, it is much lager than 1 for HBTs due to the reduced 
band-gap in the base of the latter compared to that in the Si emitter, i.e., 
NiB,SiGe > Nip,si, but Nin sice = NiE,Si- 

The weight factor hypr is larger than 1 even for the BJT and box Ge 
profile due to the bandgap variation across the BE SCR caused by the doping 
profile (i.e., the bandgap narrowing effect). A Ge grading across the BE SCR 
can increase the value of hmpr significantly as shown in Figure 3.9. 

The low-current component of the mobile charge, Qa = Tolre, is exper- 
imentally characterized by a low-current transit time 779, which is difficult 
to partition into its components from the different spatial transistor regions. 
Therefore, in HICUM/L2 the mobile charges in emitter, base, and BC SCR 
are merged into the low-current minority charge Qr [cf. (3.9)]. For describing 
the transfer current, a weighted charge hfo Qg has to be inserted according to 
(3.9). For advanced SiGe HBTs, a weight factor hg > 1 turned out to be 
necessary for modeling the transfer characteristics at medium injection levels 
while maintaining a physics-based value of Qpo. 

Two components of Qa, the one related to the neutral base and to the BC 
SCR exhibit a dependence on V pyc’. In addition, the Ge grading also causes 
the weight factor of the neutral base charge to be Vpcv dependent. Since the 
Veprcr dependence is a strong function of the actual transistor design, i.e., the 
doping concentrations in the base and collector, only a very general model 
equation according to 


hg = hgoo(1 + anto,cV Bo") (3.23) 
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Figure 3.10 Bias dependence of hfo extracted from 1D device simulations and application 
of the model equation (3.23). 


is employed. The bias dependence of hp based on 1D device simulations 
is given in Figure 3.10. The extracted values from measured transistors are 
shown later in Figure 3.28. 

The weight factors hfg and hpc for the high-current components AQsr 
and Qsc in (3.9) were already available in HICUM/L2 prior to version 2.3. 
For the emitter, (3.21) is applied with a bias-independent value according to 
Figure 3.9. The weight factor for the collector, 


(3.24) 


is only relevant at high injection since under forward operation the hole 
carrier density and related charge in the collector are negligible at low and 
medium injection. 


3.3.3 Temperature Dependence 


As outlined in the introduction of this chapter, the transfer current exhibits 
unique temperature effects for graded Germanium transistors compared to 
box transistors (or BJTs). In addition to the temperature dependence of the 
charge components, the weight factors play a significant role for correctly 
describing the temperature dependence of the transfer current. The temper- 
ature dependence of the weight factors can be derived systematically from 
(3.4) and is mostly related to the ratio of the intrinsic carrier densities in 


130 SiGe HBT Compact Modeling 


the particular transistor region k via the region-specific bandgap. The general 
equation for the weight function temperature dependence therefore reads 


hy(T) = Mnp(T)n2,(T) nk (Toni (To) ys ni, (T) na (To) 


wm 


he(To) Mak (T) ni (L) unanong (To) n} (To) ng (T) 


neglecting the temperature dependence of the mobility. Assuming a linear 
temperature dependence of the spatially averaged bandgap voltage Vz, in a 
particular transistor region k yields for the corresponding ratio 


B-P Geol) 
n2(T>) AD | Vr AT PL Ve AT 
(3.26) 


with Vako as the toward T' = 0 extrapolated bandgap voltage. This gives for 
the corresponding weight factor 


hk(T) _ Vaso — Vexo ( T 
hig (To) = exp ( ( 1)) š (3.27) 


Therefore, if the bandgap of region k is larger than that of the base region, the 
value of hy will decrease with increasing temperature. Except for hjc;, this is 
indeed the case for all weight factors in HBTs with a graded Ge profile across 
the entire base. 

Following from (3.27), for hj; the temperature dependence of hjgio is 
given by the term 


AVBE 
Vr 


hjgio(T) = hjgio(To) exp ( (7 = 1)) exp(anjmi(Zo) —anjei(T)), 


To 
(3.28) 
which has been changed into the more flexible model expression 


T CVgBE 
ie 


AVBE = VeBo = VejE0 <0 (3.30) 


and the exponent coefficient Cvgpr as model parameters. In addition to hjgio, 
the parameter dpjpi is also temperature dependent due to Xeo(T) and Vr in 


AVBE 
Vr 


hirio(T) = hjrio(Zo) exp ( 


with 
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(3.18) and (3.15). The former can be directly expressed by the corresponding 
depletion capacitance Cjgio(T), the latter directly by T, yielding 


To Cifio (To) T Shjpi 
kilT) = anni (o) = = ț anilo) o, 3.31 
anji (T) = anje: (To) T Colf) anjri( oT (3.31) 
where introducing the exponent coefficient Çnjgi as model parameter provides 
more flexibility. 
Since a large portion of hto is related to hmpgg which itself depends on 
V sjko as well, a similar temperature dependence for Afo is derived reading 


høo(T) = hgo(To) exp (A (= = 1)) (3.32) 
T 0 
The aforementioned model equations are compared with device simulation 
results in Figure 3.11, showing both the expected decrease of the weight 
factors and sufficiently high accuracy. 
The temperature dependence of the high-current weight factors follows 
directly from (3.23) and (3.21). 


Veo — Va(pcyo ( T 
h,o (T) = hge,c) (To) exp | i a i (= | (3.33) 


Figure 3.12 shows the corresponding comparison with device simulation 
results. 
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Figure 3.11 Application of the model Equations (3.29) and (3.32) to hjrio(T) and heo(T) as 
well as Equation (3.31) to anjgi obtained from 1D device simulations [Paw1 5a]. 
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Figure 3.12 Application of (3.33) to the high current weight factors hfe and hfc obtained 
from 1D device simulations [Paw 15a]. 


3.4 Charge Storage 
3.4.1 Critical Current 


Based on the Kirk-effect, the critical current [cx characterizes the onset of 
high-current effects. It is modeled in HICUM/L2 by the equation 


Voi 2 
Ree C — |14 U + y V° + aICKpt (3.34) 
by Sa 2 
Vai \ s oE 
Rcio ( + (v=) ) 
with v = (Voi — Viim)/Vpr as argument, 
Voi = Vær — Von's (3.35) 


as effective collector voltage and Vp, as the internal CE saturation voltage, 
and Viim corresponding to the electric field separating the ohmic from the 
saturation region in the velocity versus field relation. 

For low values of Vc; the entire epi-collector region becomes neutral 
at the onset of high current densities. This is represented in (3.34) by the 
first term, where the Vci-dependent term in the denominator models the 
field dependent mobility. The low-field mobility has been absorbed in the 
model parameter Roig, which resembles the ohmic resistance of the entire epi- 
collector region with an average doping concentration Nc. The parameter 
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dck Was set to 2 in versions of HICUM/L2 prior to 2.3, but has recently been 
introduced as model parameter in order to allow a more flexible voltage- 
dependent (i.e., collector field) description of Jc, (e.g., da = 1 for pnp 
transistors). 

The last term within the square brackets in (3.34) represents the high- 
voltage solution which is characterized by the collector punch-through 
voltage 


Ver = way. (3.36) 


While the collector doping has been continuously increasing for high-speed 
HBTs, the collector width wc; has decreased, yielding smaller and smaller 
values for Vpr. Furthermore, ajcKpt was recently introduced as a model 
parameter (rather than a fixed parameter) for the hyperbolic smoothing func- 
tion that connects the low- and high-voltage regions in order to provide 
a highly accurate modeling of the (quasi-)saturation region in the output 
characteristics and to avoid possible kinks due to non-physical parameter 
values [Cel14]. 

The effect of ex on the voltage dependence of cK is shown in Fig- 
ure 3.13(a). The lower ck reduces the slope at lower voltages (i.e., field 
in the collector), but the asymptotic value for cK will be the same at very 
large voltages since the saturation velocity is not changed by the parameter. 
Figure 3.13(b) exhibits the impact of ajcxpt. The possible kink in Icg for too 
small values of ajcKpt can be clearly observed. 
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Figure 3.13 (a) Impact of ôcx on the voltage dependence of Jcx. Solid lines show the actual 
Icx while dashed lines show the results for the low-voltage portion of [i.e., the first term 
of (3.34), neglecting punch-through]. (b) Impact of arCkpt on Ick for a very small punch- 
through voltage. 
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3.4.2 SiGe Heterojunction Barrier 


The changing composition from SiGe back to Si within the BC SCR causes 
a barrier in the valence band. Due to the very small difference in electron 
affinity between Si and SiGe (for typical Ge concentrations not exceeding 
30%), this barrier is almost completely given by the difference in bandgap 
between the Ge (peak) location and the pure Si collector region. For low 
current densities and CE voltages beyond strong saturation, the barrier is 
typically masked by the electric field in the BC SCR. However, at high 
current densities the electric field in the BC SCR starts collapsing due to the 
compensation of Nc; by a high mobile carrier concentration. Since the BC 
barrier initially prevents holes from being injected into the collector (unlike in 
a BJT) not only the increase in electron current density with Vp-p becomes 
limited but also a dipole layer starts to form around the barrier [Sch10]. 
This leads to a shift of the barrier height from the valence band into the 
conduction band and hole injection from the base into the collector to enable 
a further increase in current density via additional diffusion. The formation 
of the barrier is highlighted in Figure 3.14 where also the barrier height in the 
conduction band, AVc par, is defined for the case of high current densities. 

Resulting from observing its current dependence, the barrier height in 
HICUM/L2 is modeled by a simple empirical expression, 
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AVC bar = Vo,bar exp (3.37) 
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Figure 3.14 Spatial dependence of the conductance band edge for low and high current 
densities (solid lines), highlighting the presence of the barrier at high current densities. The 
dashed line shows the doping profile of the transistor just for reference. 
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with the normalized current 


ibar = frn lox (3.38) 
C,bar 

The model is based on the observation that, independent of the collector 
voltage, the barrier shows almost the same current dependence starting from 
an onset current. For the latter, cx is used since usually this onset current 
correlates with the classical Kirk-effect as long as the barrier is located not 
too far away from the metallurgic junction in the collector. Figure 3.15 shows 
the current dependence of AV Cbar obtained from 1D device simulation in 

comparison with (3.37) for a wide range of (internal) CE voltages. 
The impact of the BC barrier on the mobile charge and associated transit 
time is split into two distinct components [Sch10]. First, the charge storage 
contribution in the base region caused by the BC barrier only is calculated by 


AVC bar 
AQztb = Tats! TE lexp (Ao) = 7 ; (3.39) 


Second, the classical high-injection charge in the collector and its associ- 
ated base component (triggered by the collector related high-current effect) 
[Sch99] is extended by a barrier term, 


(3.40) 


AQm,c = Thcs Trew’ exp ( 


AVo bar = Vo,bar 
Vr , 


o extraction from 1D 


Figure 3.15 Modeling of the current dependence of the heterojunction barrier voltage for 
different voltages Vcg /V = [0.3; 0.6; 0.9; 1.2; 1.5]. 
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Figure 3.16 Transit times of a SiGe HBT showing the BC barrier effect: comparison of 
Equations (3.39) and (3.40) with results from 1D device simulation for different voltages 
Væg /V = (0.3; 0.5; 0.8; 1.0; 1.2; 1.5]. 


which causes a delayed nonlinear increase until the barrier is built up in the 
conduction band. Figure 3.16 shows the corresponding transit times obtained 
from the charge formulations above as 


dQ 


= — Al 


T 
Vog’ 


and compared to 1D device simulation results. The different components 
according to (3.39) and (3.40) are drawn separately as well as their sum which 
yields the small-signal transit time seen at the terminal of the 1D transistor. 


3.5 Intra-Device Substrate Coupling 


At high frequencies, the signal coupling between the collector of a transistor 
and its surrounding substrate can strongly affect the small-signal behavior, 
especially the HF output impedance. Different signal paths have to be distin- 
guished. Since the collector (i.e., the buried layer) of a Si-based HBT usually 
forms a pn junction with the substrate, a signal path to the bulk substrate 
contacts exists across the area component of the CS junction capacitance Cjsa 
in Figure 3.17. In addition, a signal path through the perimeter junction and 
shallow or deep trench exists which is represented by Cjsp, Cst1, and Cpri 
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Figure 3.17 Sketch of the cross section for (a) a junction isolated (with partial trench- 
isolation) and (b) a deep trench isolated collector including all relevant elements of the 
most simple equivalent circuit for modeling intra-device substrate coupling. Note that for all 
series resistance a parallel capacitance exists due to the permittivity of the substrate and that 
due to changes of the substrate-collector SCR all depletion capacitances Cjs(a,p) and series 
resistances Rgy(a,p) are bias dependent [Sch10, Paw15a]. 
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in Figure 3.17. This perimeter path is much shorter than the bulk path if the 
substrate contact surrounds the transistor and is placed as close as possible to 
the collector, which is the case for device characterization in test structures. 
This coupling effect is called intra-device coupling. However, in circuits the 
situation is quite different since typically the surrounding substrate contact 
is omitted and the substrate is contacted somewhere on the chip. In this 
case, not only the transistor output impedance has a different frequency 
dependence but there is also signal coupling though the substrate directly 
between transistors. This effect is called inter-device substrate coupling. Both 
intra- and inter-device coupling are strongly layout dependent. 

It is important to understand for both modelers and circuit designers that 
the CMs delivered in PDKs should be consistent with the p-cells offered to 
circuit designers. If the p-cells do not contain a surrounding substrate contact 
(in contrast to the characterization structures), then the PDK model should 
not include intra-device coupling. In this case, in which the circuit layout 
is unknown to the modeling engineers, it is the responsibility of the circuit 
designer to determine the substrate coupling and cross-talk related impedance 
network for each (critical) transistor! 

In this section, the discussion is limited to intra-device coupling with a 
connected substrate ring. The general impact of the signal coupling on the 
output conductance go = Re{Y22} is visualized in Figure 3.18. It can be 
seen that, for low and medium current densities, substrate coupling leads 
to a strong increase of go already at lower frequencies. This increase is 
proportional to the square of the frequency. 
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Figure 3.18 Impact of intra-device substrate coupling on the dynamic output-conductance 
given by the real part of Y22. Solid lines show the actual values including substrate coupling 
while the dashed lines show results with an ideally open substrate. 


The elements of the equivalent circuit in Figure 3.17 follow directly from 
the transistor structure. The direct path between the collector and the substrate 
may be represented either by the CS perimeter depletion capacitance Cjsp 
for the case of pure junction isolation or by a bias-independent (at least to 
first order) deep trench oxide capacitance Cpr] in case of a deep trench 
isolation. For a combination of a shallow trench and junction isolation, a 
combination of bias-dependent and -independent capacitances is required. 
Depending on the spatial distance between the junction and the substrate 
contact significant series resistances can exist in the various signal paths. For 
each of the bulk related series resistances a parallel capacitance Cs, exists 
due to the permittivity of the substrate. 

All elements given in Figure 3.17 are lumped elements which represent 
highly distributed effects that depend on the transistor dimensions and the 
operating frequency. Therefore, even more complicated equivalent circuits 
than the one in Figure 3.17 may be required to correctly capture the frequency 
behavior of the output impedance at mm- and sub-mm-wave frequencies. 
For accurately capturing the impact of intra- and inter-device coupling, the 
topology of the respective equivalent circuit along with its element values can 
generally only be obtained from analyzing the actual layout of all components 
after designing the circuit. 
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In HICUM/L2 the equivalent circuit shown in Figure 3.19 was chosen 
which is capable of capturing intra-device substrate coupling as it is encoun- 
tered during device characterization and model parameter extraction as well 
as during circuit design if p-cells with surrounding substrate contacts are 
employed. In Figure 3.19, C and S are the terminal collector and substrate 
contact, Rcx is the external collector resistance, Cjg is the bottom component 
of the SC depletion capacitance, and Rg, and Cs, represent the connection 
through the bottom part of substrate. These elements correspond to Cjsa and 
Rsu (in Figure 3.17(b)) or Rgu, (in Figure 3.17(a)), respectively. 

The perimeter substrate capacitance Cscp follows from Csr and Cjsp 
where it is important to realize that in case of a combined trench and 
junction isolation (cf. Figure 3.17(a)) portions of Cjsp may be included in 
Cis depending on the width of the trench oxide and therefore the value of 
Rsu,p- 

The additional series resistance Reont along the side-wall of the trench- 
oxide is not included in the CM in order to reduce the node count for low- 
frequency operation. As demonstrated in Figure 3.19 it can be connected to 
the substrate terminal of the CM in a subcircuit. Similar to the discussion for 
Cscp; the spatial dimensions and relations between the circuit element values 
define whether Rsu,p (if present) is merged into Reont or Rsu. 

In contrast to Cjg which is modeled depending on the internal SC voltage 
Vg, the bias dependence of Cscp can be activated or deactivated by the 
user to include the contributions of the perimeter depletion capacitance and 
the constant trench oxide, respectively. 


Cscp 


Figure 3.19 Substrate coupling equivalent circuit in HICUM/L2. 
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3.6 SiGe HBT Parameter Extraction 


For various reasons, it should not be attempted to determine the parameters of 
a sophisticated physics-based compact HBT model such as HICUM/L2 from 
the characteristics of just a single transistor’. First, the equivalent circuit rep- 
resents still to some extent the distributed device structure; for instance, there 
are internal and perimeter related elements. Also, the transfer current results 
from a well-defined transformation of a two-transistor behavior into a single 
transistor representation (cf. the effective emitter area section later) and, e.g., 
the series resistance values are scaled based on sheet or contact resistivities 
and device dimensions. The information corresponding to those partitionings 
and transformations can simply not be obtained from measurements on a 
single device. Second, a one-hat-fits-all single-device geometry is never being 
used in analog RF circuit design. In fact, exploiting a process technology 
(and amortizing the cost for its development as rapidly as possible) requires 
optimizing the circuits by using suitable and typically different transistor 
sizes and configuration for the different applications even within the circuits. 
This requires to cover a certain range of device sizes during parameter 
extraction. Third, utilizing device size adds another independent dimension 
and set of data points for determining the unknown parameters and increases 
the chances for obtaining physics-based parameter values. The number of 
independent data can be increased by adding special test structures to the 
(test) transistors. Fourth, only a physics-based set of parameters maximizes 
the use of a physics-based CM by enabling statistical and predictive circuit 
design and modeling. 

An extraction follows a certain procedure that is preferably designed such 
that as many as possible model parameters are determined independently. 
The description below will reflect that sequence, starting with an overview 
of series resistance determination. Then, the methods for those parameters 
are covered that have been introduced in the extended equations. Finally, an 
extended concept for geometry-scalable parameter extraction is described. 

Compact models are supposed to represent the typical characteristics of 
a process. For their first delivery along with the process qualification only 
limited statistics are available which need to be utilized though for selecting 
a proper die for parameter extraction. Since it is unlikely that all process 
control monitor values of an available die match all their nominal values from 


Reasonable results have been obtained though for at least all parameters related to the 
internal transistor after careful deembedding of all external elements and heavy utilization of 
optimization [Ros13]. 
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wafer tests, the extracted model parameters need to be “shifted” properly 
(e.g., [SchO5]) to the nominal values. This is possible only with a physics- 
based model or, more precisely, with the associated model parameters having 
a clear physics-based relation to process parameters. 


3.6.1 Extraction of Series Resistances 


Preferably, a series resistance can be determined from its components as 
shown in Figure 3.20 for the example of the base region. Each different 
structural region in the cross section is represented by a resistor element. The 
value of the latter can always be calculated from a specific resistivity, the 
length of the region in direction of the current flow (b), and its cross-sectional 
area. Except for contacts, semiconductor layers are typically characterized by 
the sheet resistance rg, which eliminates the need for knowing the spatially 
dependent vertical doping profile and reduces the cross-sectional area to 
the lateral layout dimension (/) perpendicular to the current flow. Contact 
resistances are calculated either from an area-specific resistivity (in Qum?) if 
the current crosses the contact cross-sectional area vertically or from a length- 
specific resistivity (in Qum) if the current flows to the side. The contact in 
Figure 3.20. belongs to the latter category. Thus, the overall external base 
series resistance of the structure example in Figure 3.20 reads: 


OB, b 

Rpx < + rBSpo7 — + TBS pm — + TBSI7— (3.42) 
[Be lB ,po IB pm le 
“~~ SSS 


Contact poly-Si on oxide mono-Si on poly-Si Link (spacer) 


where pg, is the base contact resistivity, rps ab is the sheet resistance of the 
region “ab,” and the bap and lab, respectively, are the widths and (effective) 
lengths of each region. 

The form (3.42) allows to calculate the base series resistance for all sizes 
and even for changes in the dimensions and doping profiles as they occur 
during fabrication (resulting in process tolerances) and process development. 
A general and accurate formulation of (3.42) can be found in [Sch08a, 
Sch10], which is applicable to all common contact configurations and SiGe 
HBT architectures. Obviously, to employ (3.42) for generating Rgx for a 
given HBT device size, the parameters of each component need to be known. 
The determination of the actual (i.e., not drawn) dimensions can be obtained 
from TEM and SEM measurements. Once the process is qualified, the relation 
between each actual and drawn dimension can be established, so that for 
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Figure 3.20 Illustration of the base series resistance components and their relation to the 
HBT cross section (schematic). 


model generation the (drawn) layout dimension can be used. The specific 
contact and sheet resistance can be determined based on very simple (DC) 
test structures [Sch88, Sch08b] as shown in Figure 3.21 for the example of 
the contact and poly-on-mono region. Forcing a current from B1 and B3 
and measuring the voltage drops between (a) B1 and B3 and (b) B1 and 
B2 using a Kelvin contact configuration gives two resistance values for the 
two unknowns pg, and Rgs,po. Extending the structure to include more base 
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Figure 3.21 Typical test structure (a.k.a. contact chain) used for determining the specific 
electrical resistivities of the external base resistance components. B1, B2, B3 designates the 
contacts. 
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layers (as in Figure 3.20) allows the successive determination of all other 
sheet resistances. The principle described above can be applied also to the 
determination of the components of the external collector resistance. 

From transistor theory (e.g., [Pri67, Sch10]), the bias-dependent internal 
DC base resistance is generally given by 


Rpgi = rsi e Silbe, le) Pac(IBi, rsBi, be, lE), (3.43) 
where rgpgi is the sheet resistance of the internal base region (i.e., under the 
emitter), bg (lg) is the (effective) emitter width (length), Ip; is the internal 
base current, g; is a geometry factor that takes into account all common emit- 
ter shapes [Sch91, Sch92, Sch10], and Wac is the emitter current crowding 
function. The latter can be neglected (i.e., Wace = 1) for all modern SiGe 
HBT process technologies. For a given HBT size, only the zero-bias sheet 
resistance 


Ww 
RBio = rsBio Fe see lp), (3.44) 


is required as parameter in HICUM. Hence, rgpio needs to be extracted from 
measured data. The test structure of choice here is the transistor tetrode 
[Sch88, Rei91, SchO7]. The principle method for extracting rgp; over a wide 
reverse and forward bias range under simultaneous transistor operation is 
described in detail in [Rei91]. There have been other proposals on how to 
utilize the tetrode for determining rgp; or Rp. Most recently [Sch17], the 
existing methods have been applied to various advanced process technolo- 
gies. The corresponding comparison clearly indicates that the method in 
[Rei91] is the most accurate over the widest bias range (including even high 
the high-current region). 

Furthermore, the extracted bias-dependent sheet resistance of the internal 
base, rgpi, is utilized to determine the zero-bias hole charge of the transistor 
Qpo, which is required for the accurate calculation of the transfer current. Due 
to the Early-effect and injected minority charge, the base sheet resistance 


Qpo 
Qpo + Qiri + Qjci + Qe’ 


is modulated by the voltages of both SCRs through the depletion charges 
Qjri and Qjc; and the diffusion charge Qr. Note that above equation is 
slightly simplified by neglecting the bias dependence of the hole mobility. 
Figure 3.22. shows the extracted rgpj curves in the low bias range for two 
different technologies. In each case, the different bias conditions lead to a 


rsBi = TSBio (3.45) 
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Figure 3.22 Internal base sheet resistance, normalized to its zero-bias value rsgio, as 
extracted from tetrodes for different bias conditions and technologies. 
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different contribution of the depletion charges in (3.45), while mobile charge 
is negligible in al cases. The stronger impact of the BE junction charge 
compared to the BC junction charge is clearly visible by the larger slope for 
Vgc = 0 V. Also, for the technology on the r.h.s. a larger impact of the space 
charges on Qpo, can be observed, indicating a lower base doping at a similar 
width. 

While it is possible for BJTs (i.e., bipolar technologies until the early 
1990s) to determine the emitter resistance from dedicated test structures, 
this has become impossible for modern SiGe HBTs. The reason for this is 
that the by far largest contribution to the emitter resistance still comes from 
the interface between the poly- and mono-silicon, but that its formation is 
intrinsically coupled to the emitter (poly-)layer formation; i.e., there is no 
separate emitter implant anymore. Therefore, the emitter resistance has to be 
measured directly on a HBT structure (see below). But this should be done on 
various geometries in order to obtain the simple and usually sufficient scaling 
equation, 

Rg = ppg,c/A£o (3.46) 


for a single emitter window. Here, pp, is the poly-to-mono-Si contact resis- 
tivity and Apo is area of this interface, which corresponds to the area of the 
actual emitter window opening. 

Some modelers still question the accuracy of transistor theory and prefer 
to determine series resistances directly from a given HBT structure. Over the 
past >60 years of bipolar transistor technology development, many different 
methods were proposed. There often quite different results can become 
confusing especially for young modeling engineers and the question arises 
which of these methods work at all and which ones are actually applicable 
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to advanced SiGe HBT technologies. To answer these questions, recently 
several studies on the extraction methods for base, collector and emitter 
resistance have been completed, in which the various methods were applied 
to SiGe HBTs with widely varying emitter sizes from six different process 
technologies, ranging from established production to the most advanced 
prototyping processes. In all cases, the (absolute) accuracy of each method 
was assessed by applying it to a complete HICUM/L2 model for the particular 
process and comparing the obtained resistance with the known one of the 
model. This approach also allows the investigation and identification of the 
causes of observed failures. The results are briefly summarized below; for 
detailed results, the reader is referred to the corresponding references. 

In [Kra15], a comprehensive and detailed study of nine widely used meth- 
ods (and their variants) for extracting the emitter resistance Rp is presented. 
Using high-performance and high-voltage devices with a wide range of up to 
12 emitter sizes, the results of this study are believed to be representative for 
the actual accuracy and applicability of the various Rg extraction methods. 
It was found that none of the existing methods works reliably across all pro- 
cess technologies. The most important causes of deviations are the strongly 
simplified equivalent circuit and the neglect of important physical effects 
(such as high-injection, CB barrier effect, self-heating) in the derivation of the 
methods. The two methods working mostly are the ones in [Paw14b, Hue04], 
but the one in [Paw14b] is based on the occurrence of self-heating. Methods 
based on fly-back/open-collector and impact-ionization, which increase the 
risk of device destruction, are not reliable in practice. This also applies to HF 
small-signal methods, where the impact of Rg is masked by the much higher 
base resistance. The Ning-Tang method is the least reliable of all methods. 

A detailed quantitative analysis of the most widely used methods for 
determining the base resistance directly from a transistor was performed in 
[Paw16] for a wide range of emitter geometries. The CM-based assessment 
clearly revealed that all methods only enable the extraction of the external 
base resistance, while the determination of the internal (bias dependent) base 
resistance is either impossible or, at best, limited to just a very narrow bias 
range, typically at very high injection, and are not very accurate. Small-signal 
HF methods, when operated at very low Vcg values, yield the best results, 
although still not with reliable accuracy across all technologies. A major 
cause of the failure or inaccuracy of the DC-based methods is self-heating. 
Thus, the use of DC operation-based methods cannot be recommended since 
self-heating will rather increase in future technologies. The application of the 
extraction methods to experimental data confirmed the large spread in the 
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Rpx results (for the same transistor structure) that was already observed from 
the CM. From both experimental and model based data, it was found that 
although some methods work reasonably well for some process technologies 
no method yields reliable accuracy for all technologies. Overall, it can be 
concluded that an accurate determination of both the total base resistance Rp 
and the external base resistance Rpx from widely used single-transistor-based 
methods is highly unreliable even if small-signal methods are employed. 

Finally, a comprehensive and detailed study of eight widely used methods 
(and their variants) for extracting the DC collector series resistance was 
presented in [Paw18]. Again, no method yielded accurate and reliable results 
across all technologies. But RF methods that rely on just a simple equivalent 
circuit, some of the substrate transistor-based methods, and the open-emitter 
method, yield overall the best results. The most important causes of devi- 
ations are the neglect or too strong simplification of the description of 
important physical effects (such as self-heating, high-injection). Note, the 
none of the methods yields any reasonable result for the bias-dependent 
internal collector resistance. 

The recommendation for all single HBT-based series resistance extraction 
methods is to apply them to the utilized CM again in order to verify whether 
the same result is obtained. If not, then the method is unsuitable for the chosen 
model (equivalent circuit). 


3.6.2 Extraction of the Transfer Current Parameters 


The extraction of the parameters for the transfer current of HICUM/L2 can be 
applied based on the methods described in [Ber02, Paw11]. The usual method 
is to consider different operating ranges, where only a single or very few 
unknown parameters exist. The most convenient way is to start with the low 
current weight factors hjg; and hjc; for the depletion charges. The extraction 
of the latter is not discussed here further. 

For Vgc = 0 V and low Vpgg, where the voltage drop across the series 
resistances is negligible, the DC collector current is expressed by 


C10 VBE 
Ic = Ir = ——————— exp (=) š (3.47) 
Qpo + hjeiQjri Vr 
The method described in [Ber02] rewrites the above equation as 
exp(Vse/Vr) _ Qpo z hjg 
Ir Cio  &0 


Oy = f(Qjzi)- (3.48) 
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Figure 3.23 Extraction of hjg; based on (3.48). 


Using the physics-based value for Qpo extracted from tetrodes, the parameters 
cio and hjg; can be extracted from the slope and intercept of the above 
function f(Q;ri). Results for applying this method to transistors with a graded 
Ge profile in the base are visualized in Figure 3.23. As can be seen, the data 
obtained from measurements do no yield a straight line as expected from 
(3.48) but show a curvature. Depending on this curvature, the extraction may 
yield a non-physical negative value for the intercept with the y-axis, given 
by Qpo/cio. The curvature is caused by the non-constant hjp; present for 
transistors with a graded Ge profile. Hence, according to (3.19), the parameter 
dpjzi enters into (3.48) as an additional unknown. While the method in 
[Paw11] works reliably for numerical device simulations due to the optimal 
accuracy of the results (i.e., no impact of measurement noise), results may 
become unreliable due to small noise in the measurements. The method in 
[Ste12] is based on a known saturation current and a normalized hjp;, which 
are not so simple to extract without known apņjFi. 

In this section, an alternative method is presented which is based on 
[Ste12] without involving the additional unknown. Rather than performing 
a linear fit, taking the derivative of (3.48) leads to the differential equation 


df (VBE) QjEi 
= eee ae 4 
C = f (VBE) + Wee On (3.49) 
with ee 
exp(VBE/Vr 
A q | xp(Vep/Vr) 
f(Vep) = SS VBE) oad OS | a | (3.50) 


C10 dQ ni 
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This differential equation can be solved numerically, requiring an initial value 
fı = f(Vsrı) with Vgpı being the minimum Vpgpg with reliable current 
values. For calculating apjp;, (3.19) is altered into 


ZEi ZEi 
exp (am (E) -Ue ©) -1 


Í = fi Zhi ZBi 
i Várni Ei 
anges (4) - (1 BE)” ) 
exp(u)—1 
= es 6.51) 
u 
i.e., shifting the reference from Vpp = O, as it is the case in (3.19), to 
Vg = Vegi. Note that cio is a constant value and, therefore, does not 


change the shape of the resulting curve. Defining w = f/f; according to 
[Stel2] allows the calculation of u by 


u= : wat ~ exp ( E), (3.52) 
w w w 


with W_, being the negative branch on the Lambert-W function, and finally 
ahjEi by 


U 
QniEi = z : 
hjEi (1 Way ) ZEi (1 Wisi ) ZE? 


VDEi VDEi 


(3.53) 


The form of the extracted curve for f from (3.49) strongly depends on the 
chosen initial value f1. Therefore, if the shape does not agree with that of the 
model equation, a non-constant apjr; is extracted. 

The actual extraction therefore is based on an iterative change of fı until 
the relative standard deviation of aņjgi is minimized, i.e., the most constant 
ahjFi is obtained. A bi-section method is a reliable choice for the algorithm 
here. The application is visualized in Figure 3.24. Usually, for too small 
values of f1, a decreasing curve of apjRi is obtained while too large values 
of fı lead to increasing values. The center curve in the plot shows the actual 
solution of the iteration. 

The remaining steps for extracting hjgio and c10 follow [Paw11]. Utilizing 
the extracted aņjri, (3.48) is altered to 


exp(VpE/Vr) 
Ir 


with h = hjri/hjrio from (3.19), where aņjri 1S the only parameter entering, 
which resolves the bending of the extraction curve and leads to the correct 
signs of the extracted parameters as demonstrated in Figure 3.25. 


= f(hQjri) (3.54) 
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Figure 3.24 Extraction results for anjgi based on (3.53) for different starting values f1. 


A linear extrapolation finally allows extracting c10 and hjgio from 
0 m 
aes S ndo Og (3.55) 
Yo Yo 
with m being the slope of the straight line and yo the intercept with the 


y-axis. The extraction is performed at low injection, where series resis- 
tances and self-heating have only negligible impact during extraction of hjp; 
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hQjni/{F 
J 
Figure 3.25 Extraction of hjg; based on (3.54) including the correction based on the 
extracted ahjEi. 
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Figure 3.26 Extraction results and application of the temperature model for anjri (3.31) and 
hjgio (3.29). 


and c10. Therefore, the values for anjfi, hjFio, and cio are extracted for the 
given ambient temperature. Results are shown in Figure 3.26. 

The extraction of the high-current weight factors is performed by step- 
wise inclusion of the related diffusion charges from (3.9). For obtaining 
correct results, the voltage drops across the series resistances as well as the 
actual device temperature due to self-heating have to be calculated. Starting 
with the low-current minority charge to/7¢, the weight factor is calculated 
from 


hy = 2° exp (Be 
Vr 


> ) - (Qpo + hjkiQjri + hjciQjci). (3.56) 


The application of above equation to data in Figure 3.27 shows a bias 
dependence of hfo, which has to be obtained at low current densities though, 
yielding the results shown in Figure 3.27(b). The increase at high current den- 
sities in Figure 3.27(a) is caused by the so-far-neglected charge components 
and does not yield the correct ho. 

The temperature dependence of hto in Figure 3.27(b) exhibits visible steps 
between the values of different temperature ranges. They are caused by the 
Vgc dependence of hto. Extracting hyo for a given ambient temperature yields 
the results displayed as crosses in Figure 3.28. Since these values are still 
affected by the self-heating-related temperature increase inside of the device, 
applying the temperature dependence given in Figure 3.27(b) and correcting 
only the changes due to self-heating effects yields the results given by circles 
in Figure 3.28 which still displays a weak bias dependence. 
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Figure 3.27 Extracted values for hgo from (3.56) for (a) room temperature and different 


Vac. (b) Extracted values chosen at low current densities for different temperatures and Vgc 
[Paw 15a]. 
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Figure 3.28 Extraction results for the bias-dependent hto at room temperature for the results 
given in Figure 3.27(b) [Paw15a]. 


The extraction of hfg follows the same steps by further including 
heg tol re into (3.56), thus calculating hfg from 


_ C10 Vee 
= —exp Vir 


) —(Qpo + hjkiQjri + hyciQjci + htottolrs). (3.57) 


The data in Figure 3.29(a) initially show a strong dependence on Ir and 
Vgc, which is caused by self-heating and the temperature dependence of 
hep. After taking into account the temperature dependence according to 
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Figure 3.29 (a) Extracted hfg values at room temperature for different Vgc as indicated by 
the arrow. (b) Extracted temperature dependence [Paw 15a]. 
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Figure 3.30 (a) Extracted values for hfc at room temperature for different Vgc as indicated 
by the arrow. (b) Extracted temperature dependence [Paw 15a]. 


Figure 3.29(b), the bias dependence of hfg almost vanishes (see solid curves 
in Figure 3.29(a)). 

As the final step, htc is extracted analogously by also including htp AQ Es 
and AQ@pr into the calculations. Note that AQpr is weighted by 1. Also for 
htc, a strong bias dependence is observed initially, which disappears though 
after correctly including the effect of self-heating. 


3.6.3 Physics-Based Parameter Scaling 


HICUM/L2 has emerged as one of the industry standard bipolar transistor 
models and was therefore selected as the backbone of the compact transistor 
modeling strategy in DOTSEVEN. From its first development on in the 
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1980s, the model has been formulated with geometry scaling capability in 
mind since this feature has been crucial to achieve the optimum circuit 
performance for a given process technology. Geometry scaling is fundamen- 
tally based on a physics-based model formulation and parameter extraction 
strategy. Unlike compact MOSFET models, bipolar transistor models do not 
include scaling equations in their simulator code for several reasons. First, 
the large variety of possible contact arrangements and structures makes such 
equations complicated. Second, the accurate calculation of the impact of 
some effects, such as the geometry dependence of the substrate and thermal 
coupling, requires the solution of implicit or even differential equations 
which are difficult to program in simulator-supported description languages 
including Verilog-A. 

Therefore, the development and implementation of HBT geometry scal- 
ing formulations is typically left to the foundry or model user. Due to 
the physics-based formulation of HICUM/L2, its important model param- 
eters can be expressed readily in terms of transistor geometry. Within 
DOTSEVEN, significant effort was spent on further improving the model’s 
geometry-scaling capabilities for most advanced SiGe HBT structures and for 
developing reliable geometry-scalable parameter extraction methodologies. 
Besides enabling the selection of the optimal transistor size for any given 
circuit application, another benefit is the reduction of the so-called “parameter 
extraction noise.” The latter is a well-known and undesired effect that results 
in erratic and unpredictable parameter variations with respect to transistor 
geometry. It occurs when correlated model parameters are extracted on a 
set of individual transistors by numerical optimization. Geometry-scalable 
parameter extraction methods have the additional advantage of smoothing 
random measurement errors and allowing the detection of systematic mea- 
surement errors due to, e.g., measurement (equipment) limitations as well as 
test structure layout and process issues. Finally, scalable models ensure that 
important model parameters (like the base resistance) behave properly with 
respect to geometry. 


3.6.3.1 Standard geometry scaling equation 

The most important concept regarding the scaling equations used in con- 
junction with HICUM/L2 is probably the concept of an effective (electrical) 
emitter area. At low current densities, (i.e., for negligible voltage drops 
across series resistances), the total collector current can be split into an 
internal and perimeter portion, each related to a specific emitter region 
(see Figure 3.31). 
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Figure 3.31 The collector current flow in the emitter can be split into an intrinsic portion 
related to the emitter area and a portion related to the perimeter only. The right picture 
shows the spatial distribution of the vertical electron current density. Marked are the actual 
emitter width bgo as well as the effective emitter width bg, calculated from beo and 
yc (cf. 3.58, 3.60). 


Assuming no impact of collector avalanche or tunneling effects, this 
graphical concept can be expressed mathematically by 


Ic = Ica Aro + Icp Pro, (3.58) 


where Ago = bgoleo is the actual emitter window area and Pgo = 2bgo +2lE0 
is the actual emitter window perimeter*, both given by the window open- 
ing and interface area of the emitter poly-silicon with the mono-silicon 
region. Furthermore, Jc, Ago and Icp Pro, respectively, are the collector 
components resulting from carrier injection across the emitter window area 
and window perimeter, respectively. Equation (3.58) can be conveniently 


reformulated as a 
EA E (1 i jo) 
Ica Aro 


By introducing the process-specific parameter yc, defined as the ratio 
of perimeter-specific to area-specific collector current, (3.59) defines the 
effective electrical emitter area 


(3.59) 


P 
Ap = Axo (1 T 7) , (3.60) 


4Simplified equation neglecting corner contributions and possible corner rounding. 
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Figure 3.32 Schematic illustration of the effective emitter area concept. The area and 
perimeter currents are gathered in a single effective contribution. 


which reduces the two current components of (3.58) to a single component 
with a clearly defined geometry dependence. Therefore, as can be seen from 
Figure 3.32, this approach allows to represent the internal and perimeter 
transistor by a single-transistor model having an effective emitter area Ap. 
This is obviously advantageous over a two-transistor model approach in terms 
of computational efficiency and parameter extraction effort. This concept can 
be easily extended to the modeling of other current components as well as to 
the charges (and capacitances) of a transistor structure. 

According to (3.59), plotting I¢/Ago versus Pao /Ago allows to extract 
the geometry scaling-related parameters Ica and yc from the y intercept and 
the slope of the curves. This procedure is also known as the P/A (perimeter 
over area) approach. The application of this concept to experimental data 
of a DOTSEVEN process in Figure 3.33 displayed the excellent scalability 
of the collector current. Notice that the use of data from several structures 
helps averaging out local process variations as compared to performing the 
extraction from just a single device. 


3.6.3.2 Generalized scaling equations 

During the various projects (DOTSEVEN, DOTFIVE, RF2THz) deviations 
from the P/A scaling were observed for some process versions, [Paw13, 
Paw15b]. It turned out though that such non-standard scaling behavior 
could be captured by an extension of the standard P/A approach. Figure 3.34 
depicts schematically this extension. Again, the goal is to combine the models 
associated with each lateral region of the transistor into a single model rep- 
resenting a transistor with an effective emitter area Ap. The extension relies 
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Figure 3.33 Experimental data of Ic/Aro versus Pro/Axro for Vgs = 0.45-0.5 V in steps 
of 10 mV. Aro and Pro, respectively, are the actual emitter window area and perimeter, 
respectively. The drawn emitter dimensions are (0.31, 0.35, 0.4, 0.53, 0.7, 1.2, 2.2)x 10 jrm?. 
Symbols represent measured data and dashed lines results from linear regression. 


on considering the four different components, defined by an injection across 
the window area, width and length related perimeter junctions, and corner 
junctions, separately. Obviously, the simple P/A approach in Figure 3.32 is 
just a special case of this extended generalized linear scaling approach and 
is obtained when the specific currents related to the width, length, and corner 
are merged into a single perimeter related specific current. 

In order to properly extract the scalable model parameters for the general- 
ized scaling approach, a matrix of test structure is required with the same set 
of emitter widths for at least two emitter lengths (see Figure 3.35). From the 


Figure 3.34 Schematic illustration of the generalized effective emitter area concept. 
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Figure 3.35 Required matrix of test structures for scalable parameter extraction in the case 
of the generalized scaling laws. 


corresponding measurements, one can obtain a complete set of parameters 
for each of the different transistors (and associated lateral regions) depicted 
in Figure 3.34. These parameters then need to be transformed into a set for a 
single-transistor model. This has been implemented with the help of a Taylor 
series expansions in the geometry scaling equations for HICUM/L2. 


3.7 Compact Model Application to Experimental Data 


Within DOTSEVEN and related research projects (such as DOTFIVE, 
RF2THz), HICUM/L2 model parameters were determined from the electrical 
characteristics measured for many process runs and technology versions. 
Since the vast amount of data and comparisons cannot be displayed here 
due to the lack of space, just an overview is given that is based on selected 
publications’ and the complete version of HICUM/L2 with all the previously 
described extensions. 

An overview on the overall parameter extraction approach and employed 
procedures is given in [Pawl1], while [Paw14a] highlights the improve- 
ments in modeling the transfer current of SiGe HBTs using the extensions 
described in Section 3.3 and providing a guideline for extracting the new 
parameters. Very detailed information on both parameter extraction and 
model comparisons for different process technologies have been given in 
[Paw 15a][Kral5b][Ros17]; these include a large number of DC, AC and 


Note that — in contrast to circuit design — the (successful) results of compact modeling can 
typically not be published for subsequent (and intermediate) process technology versions with 
improving electrical performance. 
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large-signal results for a large variety of transistor structures of the technolo- 
gies developed in different research projects. 

A general overview on the modeling results of DOTSEVEN was given 
in [Paw15b] focussing on a veriety of characteristics and operating condi- 
tions. The application of the compact model with a focus on decomposing 
the impact of different physical effects for guiding process technology 
development has been given in [Paw13], [Kor15], [Paw17a] and [Paw17b]. 

High-frequency noise, including the correlation between collector current 
and stored base charge and its generic implementation in circuit simulators, 
was discussed in [Her12]. There, the applicability of the noise correlation 
formulation in HICUM/L2 was verified based on the results of the Boltz- 
mann transport equation for frequencies at least up to 500 GHz. Noise 
measurements at such frequencies are presently impossible so that experi- 
mental verifications have been restricted to 50 GHz so far [Her12], [Sak15b], 
[Sak15]. Here, the accuracy of the model has allowed the decomposition of 
the various physical noise mechanisms within the transistor, yielding valuable 
insights into their magnitude and relative importance. 

Applications of the compact model with a focus also on circuit results 
have been demonstrated in [Ard15][Sch16c][Lial7][Sch1 8b]. 
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4.1 Introduction 


High-frequency characterization of active and passive devices is carried out 
by extracting the scattering parameters of the component (often in a two- 
port configuration) employing a vector network analyzer (VNA). This class 
of instruments allows to characterize the response of the device under test 
(DUT) over a broad frequency range (exceeding 1 THz [Dio17]) at a user- 
defined reference plane. In order to define such reference planes and remove 
all the imperfections of the measurement setup (i.e., cable and receiver 
conversion losses, amplitude and phase tracking errors, and other statisti- 
cal errors), a calibration procedure [Ryt01] needs to be carried out prior 
to the measurement. The calibration procedure employs the knowledge of 
the devices used (i.e., standards) to solve the unknowns representing the 
measurement setup response (often referred to as error terms). The derived 
error terms allow then to remove the imperfections of the setup, during 
the measurement procedure. The accuracy of the calibration is then directly 
dependent on the accuracy with which the standards are known [Stu09]. In the 
literature, different calibration techniques have been presented, often trading 
off (more) knowledge on the response of the standard device for (lower) 
space occupancy (i.e., when considering SOLR/LRM [Fer92; Dav90] cali- 
brations versus TRL type ones [Eng79]). Traditionally, calibration techniques 
requiring little standards knowledge (e.g., TRL, LRL) have been considered 
the most accurate, with TRL reaching metrology institute precision, by only 
requiring the information of the characteristic impedance of the line [Eng79]. 
In this chapter the focus will be placed only on TRL calibration techniques 
due to their best compatibility with millimeter- and sub-millimeter-wave 
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characterization. For a more extensive discussion on the various possible 
calibration techniques the reader is referred to [Tep13]. Calibration tech- 
niques for on-wafer measurements typically consist of a probe-level cali- 
bration (first-tier) performed on a low-loss substrate (i.e., alumina or fused 
silica) [Eng79; Eul88; Dav90; Mar91a]. This probe-level calibration is then 
transferred to the environment where the DUT is embedded in and often, 
to increase the measurement accuracy, this calibration is augmented with a 
second-tier on-wafer calibration/de-embedding step. This allows moving the 
reference plane as close as possible to the DUT, by removing the parasitics 
associated with the contact pads, the device-access lines, and the vias [Tie05]. 
In this chapter we will first review the challenges and potential solutions 
associated with first-tier calibrations performed on low-loss substrates, then 
the approach to design calibration kits integrated in the back-end-of-line 
of silicon based technology will be presented, and finally a direct de- 
embedding/calibration strategy, capable of setting the reference plane at the 
lower metal layer of a technology stack, will be described. 


4.2 Multi-mode Propagation and Calibration 
Transfer at mm-wave 


The different propagating modes supported by a coplanar wave guide (CPW) 
are qualitatively sketched in Figure 4.1. The CPW mode, characterized by 
opposite direction of the fields across the slots, represents the intended 
propagation mode and is often referred to as CPW differential mode. The 
CPW mode characterized by in-phase direction of the fields across the 
slots (Waap) represents an unwanted radiating mode and is often referred 
to as CPW common mode. The TM, and TE,, modes are surface waves 
propagating along the grounded dielectric slab. Their cut-off frequencies 


—» CPW even mode — — m TM, surface mode 
---» CPW odd mode —--* TE, surface mode 


Figure 4.1 Cross section of a coplanar wave guide (CPW) with finite ground planes, and 
sketches of the E field distributions of the first propagating modes supported. 
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(n > 0 and m > 1) are functions of the height and dielectric constant of 
the substrate [Poz04]. The overall effect of the unwanted modes described 
above is an increase of the transmission line losses (i.e., |S21|) and the gen- 
eration of ripples on the transmission parameter (i.e., S21) of the CPW. The 
ripples are the results of interference (constructive or destructive depending 
on the frequency) between the unwanted modes, reflected by discontinuities 
(i.e., dielectric constant changes), and the intended CPW mode. The lines 
conventionally employed for probe-level TRL calibrations, are: 


e The thru standard: A CPW line with a physical length in the order of 
200-250 um, 

e The line standard: A CPW line providing an insertion phase of 90° at 
the center of the calibration band. 


The analysis presented in this section is based on numerical 3D EM 
simulations, i.e., using Keysight EM Pro. 


4.2.1 Parallel Plate Waveguide Mode 


During the calibration procedure the substrate is placed on a metallic wafer 
chuck, creating effectively a grounded coplanar waveguide (GCPW) struc- 
ture, as shown in Figure 4.2(a—c). This structure supports, in addition to the 
modes shown in Figure 4.1, also a parallel plate waveguide (PPW) mode. 
This occurs since the top (CPW line) and bottom (chuck) metal are not 
directly contacted, thus a different potential can exist and propagate. The 
PPW mode can be visualized by plotting the E field intensity below the 
metal surface, as shown in Figure 4.2(c—d). In the figure the intensity of the 
E field is acquired on the xy plane placed below the metal plane (i.e., 5 uym). 
Note, both plots use the same range for the field intensity (blue = minimum, 
red = maximum) to allow for a direct visual comparison. Conventionally to 
reduce the PPW mode propagation, an interposer substrate of ferromagnetic 
material (i.e., providing high losses for the EM waves) is used between 
the calibration substrate and the metal chuck. Figure 4.2(d) shows a partial 
reduction of the PPW mode when simulating with the absorber structure. 
Alternatively, dielectric chucks with a permittivity similar to the one of the 
calibration substrate can be used to remove the occurrence of the PPW mode. 


4.2.2 Surface Wave Modes: TMo and TE: 


The overall loss behavior of the CPW structure, including the surface waves 
when fed by a wafer probe, can be analyzed by using the 3D simulation 
environment shown in the inset of Figure 4.3(a). A point voltage source with 
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(a) (b) 


(c) (d) 
Figure 4.2 Cross section of CPW placed (a) on metal chuck, (b) on absorber. Electrical field 


intensity below the CPW metal plates (5 um) for case (c) no absorber and (d) with absorbing 
boundary conditions in the 3D FEM simulation, both fields were computed at 180 GHz. 


G 5 G 
Calibration substarte 


a source impedance of 50 Ohm is applied to the bridge to provide a transition 
similar to a wafer probe. In the 3D simulation environment, the boundary 
conditions were set to absorbing, thus providing perfect match condition to 
all the unwanted modes within the structure. 

Figure 4.3(a) compares the insertion loss of the CPW structure realized in 
alumina when the substrate (sub) is enlarged and an air gap (GAP) is applied 
between the substrate boundary and the radiation boundary of the box, see 
Figure 4.3(b). Note that the multiple reflections of the unwanted modes within 
the structure generate an interference pattern (dependent on the distance to 
the discontinuity) along the trace, as can be seen by the shift of minima and 
maxima points when the sub-parameter is changed. The simulation does not 
include conductive or dielectric losses thus the decrease in the transmission 
parameter S21 in Figure 4.3(a) can only be attributed to energy dissipated in 
the other modes supported by the structure. When considering real structures 
on alumina substrate (i.e., exhibiting also dielectric losses), it is expected 
that the lines closer to the edge of the calibration substrate will exhibit 
a stronger ripple caused by interference with the surface wave mode. In 
Figure 4.3(b) structures with different distance to the substrate edge where 
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(c) 
Figure 4.3 (a) Simulated S21 of CPW on alumina substrate for various cases: CPW no GAP 
sub = 0 wm GAP = 0 um, CPW GAP sub] sub = 320 um GAP = 500 um, CPW GAP sub2 
sub = 420 um GAP = 500 um; (b) CPW structure used in the EM simulator with highlight 
on the lumped bridge configuration; (c) measurement of different (4) thru lines on alumina 
substrate in different locations of the calibration substrate. Locations (i.e., two middle and two 
center) identified in the inset on top right. 


measured (i.e., center and edge) for the alumina substrate. As can be clearly 
seen by the figure, the structures at the edge of the substrate exhibit a clear 
interference pattern, as predicted by the simulation analysis. 


4.2.3 Electrically Thin Substrates 


Employing lower €, substrates shifts the occurrence of the TM; and TE 
modes to higher frequencies, and reduces the amount of energy radiated 
by the CPW common mode, for a given frequency, due to the smaller gap 


168 (Sub)mm-wave Calibration 


dimension for a given signal width. For these reasons, fused silica €, can be 
considered as a good candidate to integrate CPWs to perform TRL calibration 
in the (sub)mm-wave bands. The same simulation analysis performed for the 
alumina case in Figure 4.3(a) was carried out for the fused silica substrate, 
see Figure 4.4(a). As can be seen by the plot a considerably lower amount of 
energy is transferred to other modes. Moreover, the lower dielectric constant 
of the substrate provides lower discontinuities when terminated with air, 
showing close to no-variation when performing a simulation varying the 
dimension of the parameters sub and GAP, see Figure 4.4(a). 

The measured results are then compared with the simulation showing 
very good agreement in WR3 band, as shown in Figure 4.4(b), confirming 
also the low loss achieved by the CPW realized on fused silica. Note, that 
the deviation that can be observed between measured and simulated data 
above 290 GHz can be explained with reduced sensitivity of the measurement 
equipment, closer to the edge of the specified band (i.e., WR3 220-325 GHz) 
and the onset of unwanted modes in the fused silica substrate. 


4.2.4 Calibration Transfer 


In the previous paragraph the usage of electrically thin substrates was 
introduced to overcome the limitations exhibited by commercially available 
calibration devices operating in the mm-wave bands. While using such sub- 
strates (i.e., fused silica) improves the calibration quality, an important point 
is that the measurement quality will also depend on the error introduced by 
transferring the calibration to the environment where the DUT is embedded. 
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(a) (b) 
Figure 4.4 (a) Simulated S21 of CPW on fused silica substrate for various cases: CPW no 
GAP sub = 0 um GAP = 0 um, CPW GAP subl sub = 320 um GAP = 500 um, CPW 
GAP sub2 sub = 420 um GAP = 500 um; (b) measurement versus simulation of a thru line 
on fused silica substrate. 
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It is often the case that the DUT is embedded in a different host medium 
compared to the calibration, i.e., Si, SiO2, GaAs, or other substrate materials. 
When the measurement is performed on the new host medium, a different 
probe to substrate interaction will occur, which would not be corrected for by 
the calibration. This will introduce a residual error that would be a function 
of the difference in permittivity between the two substrate materials (i.e., 
calibration and measurement). In a first-order approximation, the calibration 
transfer effect, associated with the change in the error box, can be seen as a 
capacitive coupling between the probe tip and the substrate, as schematized in 
Figure 4.5. This capacitance can be found using a numerical optimizer when 
the probe geometry is partially known, allowing to minimize the calibration 
transfer error in the measurement frequency band. 

Figure 4.6 shows the results, in terms of worst case of the error bound 
[Wil92], when transferring the calibration from the primary calibration envi- 
ronment (i.e., alumina and fused silica) to a verification line embedded in 
the back-end-of-line of a SiGe high-speed process. The calibration quality is 
evaluated before any optimization is applied (full symbols and solid lines) 
and after the application of the correction (empty symbols, dotted lines). 
The maximum value for the error associated with the calibration of alumina 
decreased from 0.12 to 0.06, with an improvement noticeable over the entire 
bandwidth. However, no significant improvement is obtained for the fused 


Figure 4.5 Schematic representation of the capacitive coupling between the probe tip and 
the substrate where the device under test (DUT) is embedded. 
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Figure 4.6 Worst case error bound for calibration transfer from fused silica and alumina to 
SiGe BEOL, before correction (full symbols, solid lines) and after correction (empty symbols, 
dotted lines), obtained with on-wafer measurements on a 600 ym CPW line manufactured on 
IHP SiGe 130 nm BiCMOS technology, in the frequency range from 75 to 110 GHz. 


silica, where the error associated with the difference in substrate coupling due 
to calibration transfer is small, due to the similarity of permittivity between 
the fused silica and the silicon dioxide present in the back-end-of-line of the 
process. 


4.3 Direct On-wafer Calibration 


In order to avoid the error arising from the process of transferring the 
first-tier calibration to another environment, the calibration kit should be 
implemented in the same environment as that of the DUT. Classical probe- 
level and on-wafer calibration techniques are based on (partially) known 
devices and lumped models of the DUT fixture (i.e., SOLT/LRM and 
lumped de-embedding) or employ distributed concepts (TRL and multi-line 
TRL). Due to the objective difficulty, especially at higher frequencies, in 
manufacturing an accurate and predictable resistor in a commercial silicon 
technology, (multiline)-TRL calibration represents the standard employed 
technique for in situ calibration, as was shown in [Yau10; Yau12; Will3a; 
Will3b; Wil14]. The TRL technique does not require resistors to define the 
measurement normalization impedance, which is instead set by the charac- 
teristic impedance of the lines used during the calibration. Thus, the accurate 
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(frequency-dependent) determination of the calibration lines characteristic 
impedance becomes a key requirement to allow the correct re-normalization 
of TRL-calibrated S-parameter measurements. 


4.3.1 Characteristic Impedance Extraction 
of Transmission Lines 


To accurately employ TRL techniques in a complex environment such as 
the BEOL of Silicon-based technologies, a robust approach is required to 
extract the characteristic impedance of the transmission line. Traditional 
extraction procedures are based on measurements [Eis92; Mar91la; Wil91a; 
Wil91b; Mar91b; Wil98], but are only accurate when specific assumptions 
are verified, such as, low loss substrate, constant capacitance per unit length 
[Mar91b], and uniform [Eis92] non-inductive pad-to-line transitions [Wil98; 
Wil01]. For a more extensive analysis of the shortcomings of these methods 
for (sub)-mm-wave calibration in the BEOL of silicon technologies, the 
reader is invited to read [Gal17a], where a characterization flow employing 
3D EM simulations was developed and validated to accurately extract the 
Zo of transmission lines, excited using waveguide (modal) excitation. The 
scattering parameters computed during simulation are re-normalized to a 
given system value (i.e., Zsys = 50 Q) and used in Equation (4.1) to compute 
the line characteristic impedance [Eis92]: 


(1+ 5,) — 83, 
Ty Fight AAM 4.1 
ane (attest on 


The approach was validated by benchmarking it with the calibration compari- 
son method using a calibration kit integrated in the BEOL of the IHP SG13G2 
130 nm SiGe BiCMOS technology. For the purposes of a fair comparison, 
the lines were designed to be uniform, with no line-to-pad discontinuities to 
provide an accurate test case for the calibration comparison method [Wil01]. 
The calibration kit was designed to allocate different waveguide bands from 
75 GHz to 325 GHz. The micro-photographs of the WR-5 (140-220 GHz) 
structures are displayed in Figure 4.7(a—c), while Figure 4.7(d) shows the 
schematic line cross section. 

The structures were simulated using three different 3D electro-magnetic 
simulators, Keysight EMPro, Ansoft HFSS, and CST Studio Suite, to check 
for simulation discrepancies. In the model, the meshed ground planes have 
been simplified considering a continuous metal connection, both vertically 
and horizontally. This simplification provides good approximation of the 
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(a) 


Figure 4.7 Coplanar wave guide CPW calibration structures realized on IHP SiGe 130 nm 
BiCMOS technology. (a) Microphotograph of the thru line, (b) of the reflect standard and (c) 
of the transmission line employed for the WRO5 calibration kit, (d) schematic cross section of 
the CPW line. 


electrical response of the structure since the openings in the metal mesh 
are much smaller than the wavelength (maximum aperture is in the order 
of 2.5 x 2.5 um). The excitation of the CPW lines is provided by means 
of waveguide (modal) ports. The simulator first solves a two-dimensional 
eigenvalue problem to find the waveguide modes of this port and then matches 
the fields on the port to the propagation mode pattern, and computes the 
generalized (i.e., mode matched) scattering parameters. In all the simulators, 
the port dimensions are designed using the rules of thumb described in 
[Wei08], ensuring ideally no fields at port boundaries, as also depicted in 
Figure 4.8 for two simulator examples. 

Absorbing/radiation boundaries are then imposed at the lateral and top 
faces of the simulation box. The box is defined horizontally by the dimensions 
(length/width) of the simulated structure, and vertically by the wavelength 
(A/4 at minimum simulation frequency). The bottom face of the simulation 
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Figure 4.8 Field distribution on waveguide ports at 300 GHz when exciting the structures 
described in Figure 4.7, for (a) Keysight EMPro and (b) Ansoft HFSS. 


box is defined as a perfect electric conductor, simulating the presence of a 
metallic chuck underneath the structure, as it is the case during measure- 
ments. The absorbing boundaries simulate an unperturbed propagation of the 
EM waves through this boundary. In this respect, the interference with other 
structures on the wafer is not taken into account in the simulation. Mate- 
rial parameters and lateral dimension are chosen according to the nominal 
technology values. 

Figure 4.9 shows the comparison of the characteristic impedance com- 
puted using the proposed method (with different EM simulation tools), and 
the characteristic impedance extracted with measurements using the calibra- 
tion comparison method [Wil01] and the Eisenstadt method [Eis92]. Both 
the measurement based methods are hampered by the (small) discontinuity 
presented by the probe to line transition, as predicted in [Mar92] and [Wil01]. 
The EM-based method offers fairly constant (with frequency) characteristic 
impedance response, as expected. It is interesting to note how simulations 
performed employing different EM tool, produce slightly different values 
for the characteristic impedance (max. 1 Q for the real part and 0.1 Q for 
the imaginary part), when applying similar settings in terms of meshing 
and solving methods. The differences can be attributed to different meshing 
algorithms, discretization, etc., of the tools which could all be categorized 
as the intrinsic uncertainty of the proposed method. This comparison shows 
how the proposed method provides comparable results to the state-of-the- 
art techniques, when the validity of the latter is still guaranteed by the 
transmission line design. As the method of [Gal17a] is ideally valid for any 
kind of transmission line, it can be employed also in situations in which large 
inductive probe-to-line transitions are present, which is the case when vias 
are involved. 


174 (Sub)mm-wave Calibration 


— — HFSS Z0 (this paper) 
364| —— EMPro Z0 (this paper) 
341 7T CST ZO (this paper) 
m Method of [22] Ree 
32 O _Method of [17] ee 


Re(Z,) (9) 


50 100 150 200 250 300 350 
frequency (GHz) 


Figure 4.9 Real part of characteristic impedance for the line shown in Figure 4.7(a), 
computed with the simulation approach described in [Gal17a] (solid lines EMPro, dashed 
lines HFSS, dashed-dot lines CST), and measured using the method of [Eis92] (empty circles) 
and the method of [Wil91b] (filled squares). 


In order to compare the different calibrations, the method of [Wil92] has 
been employed, defining an upper bound (UB) error metric as: 


UB(f) = maz |S; (F) — Si3(f)| (4.2) 


Where S’ is the reference scattering matrix of the verification line (i.e., 3D 
simulated S-parameters), S(f) is the frequency-dependent scattering matrix 
resulting from the investigated calibrations (i.e., LRM on alumina, TRL on 
fused silica and TRL on BiCMOS) and 7,7 € [1,2]. This metric defines the 
UB of the deviation of the S-parameters measured by one calibration and the 
reference S-parameters computed using EM simulations. The measurement 
data used to compute the error bound of Figure 4.10 are based on the same 
raw data of the verification line, thus removing any measurement variation 
of the verification structure from the error propagation mechanisms. On 
these raw data the respective calibration algorithm (with their respectively 
computed error terms) were applied. In addition, both the methods indicated 
as TRL on silicon in Figure 4.10 use also the same raw measurement in 
the calibration procedure (i.e., extraction of error terms), thus confining 
their difference only to the characteristic impedance values versus frequency, 
computed with the two different methods. 
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Figure 4.10 Comparison of probe-tips corrected measurements of a verification line manu- 
factured on the SiGe BEOL in the frequency range 75-325 GHz for different calibrations. 


As can be seen from Figure 4.10, the calibration performed on SiGe tech- 
nology is the one that presents smaller deviation from the reference data, with 
an UB < 0.17 in the entire frequency band for both characteristic impedance 
extraction methods considered, i.e., the proposed EM-based method 
(Figure 4.10, asterisks) and the calibration comparison method (Figure 4.10, 
filled squares). 


4.4 Direct DUT-plane Calibration 


The method to derive the characteristic impedance of a transmission line 
described in the Section 4.3 “Direct On-wafer Calibration” will be applied 
to extract the Zo of transmission lines employed in a TRL calibration/ 
de-embedding kit to perform S-parameters measurements at the lowest metal 
layer (M1) for direct DUT access. Realizing transmission lines in the lowest 
metal layers can present several challenges, typically associated with the 
losses of the underlying substrate (i.e., conductive silicon). One solution was 
proposed in [Gal17b], where a CPW line realized at M1 was capacitively 
loaded with a series of floating metal bars (CL-ICPW), realized in a higher 
metal layer, in order to confine the propagating electromagnetic field in the 
low loss oxide of the BEOL. 
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Figure 4.11 Simplified schematic top-view of a generic test-structure realized with CL- 
ICPW. (a) Input section, (b) M7-M1 vertical transition and (c) DUT stage. 


This line topology can be employed in the TRL calibration/de-embedding 
kit, as depicted in Figure 4.11. The general structure of the fixture features 
three main sections: an input stage [pad plus launch line, section (a); a 
transition from top metal to M1, section (b), composed by all metal layers 
and interconnecting vias; and the final section (c), realized on M1 using 
CL-ICPWs that can feature a transmission line i.e., thru or line for the 
TRL de-embedding kit] or an offset short. The calibration/de-embedding kit, 
was manufactured using the BEOL of Jnfineon’s 130 nm SiGe BiCMOS 
technology B11HFC, featuring seven metal layers. Figure 4.12(a) shows a 
cross section of Figure 4.11, section (a), where M3 is used as ground shield 
in order to isolate the CPW from the lossy substrate. The transition from the 
top metal center conductor of the CPW to the M1 center conductor of the 
CL-ICPW is realized using a gradual, inverse pyramidal shape. This allows 
to connect the large top metal conductor (i.e., 30 ym width) with the smaller 
M1 line, keeping the ground reference at the same metal level (i.e., M3) as 
shown in Figure 4.12(b). 

For the DUT stage, M3 is chosen as the metal layer for the floating 
shield. This choice allows reducing the losses while guaranteeing a Zo of 
34 Q, sufficiently close to the 50 Q required to minimize the errors arising 
from reflection losses when measuring in a conventional VNA-based setup 
[Mub15]. The shield is realized with 2 zm wide metal strips and a fill factor 
of 50% in order to respect the density rules. The cross section of the final 
design for the CL-ICPW is shown in Figure 4.12(c). Micro-photographs 
of the calibration/de-embedding kit for WR-3 (220-325 GHz) waveguide 
bandwidth are shown in Figure 4.13. 
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Figure 4.12 Schematic cross section of the input stage used for the test structures (a). 3D 
model of the vertical transition connecting the central conductor of the input stage in M7 to the 
CL-ICPW central conductor in M1 (b). Schematic cross section of the CL-ICPW employed in 
the DUT stage of the calibration kit (c). 


The kit employs 130 um long launch lines. The thru standard is realized 
by means of a 150 um CL-ICPW, and it is designed to embed the final DUT 
(transistor here) in its center reference plane. The de-embedding kit reflects 
are realized by two symmetric offset shorts, with an offset equal to half the 
thru length. Further, a longer line with an additional 80 wm length for the 
CL-ICPW, in respect to the thru, is realized as the line standard. Finally, a 
test structure consisting of a 310 um long CL-ICPW has been realized for 
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Figure 4.13 Micrograph of the de-embedding kit on Infineon B11HFC technology. 


verification. EM simulations are then performed to extract the characteristic 
impedance of the line. Note that the only structures simulated are the CL- 
ICPW in Figure 4.12, section (c). For this purpose, the procedure described 
in Section 4.2 is employed. Once the characteristic impedance is extracted, 
the proposed kit can be employed for direct calibration at M1. To demon- 
strate the proposed calibration/de-embedding method in its final application, 
measurements of a heterojunction bipolar transistor (HBT) featuring two 
emitter fingers with 5 um length and 220 nm width were performed. The 
device was embedded into the test fixture employing CL-ICPW in common- 
emitter (CE) configuration directly at the calibration reference planes, shown 
in Figure 4.14. To guarantee proper connection between the CL-ICPW test 
structure and the transistor (BECEB) modeled in the process design kit 
(PDK) (i.e., employing a p-type guard ring around the active device, with 
ground contacts connected to metal level 1) a small bridge at metal 2 (see, 
Figure 4.14(b) was added. After calibration, EM simulations of these lines are 
used to de-embed them from the measurements. Note, that the configuration 
and interconnections (no M1 connections between the emitters and the bases) 
is only illustrative of the technique. When a different reference plane needs 
to be defined and different parasitic element included or excluded from the 
device model this can be achieved by properly setting the reference plane of 
the calibration through the proper design of the reflect standard and the zero 
length thru position. 
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(b) 
Figure 4.14 Top view of the transistor (BECEB) integrated into the test-structure 
(a) Detailed view of the layout for the integrated transistor (b), highlighting the input and 
output fixture (in yellow) required to guarantee connection to the intrinsic device. The base, 
collector, and emitter contact (B, C, and E, respectively) are marked on the layout. 


The device S-parameters have been measured using the direct calibration 
technique in the frequency range from 220 to 325 GHz, using fixed bias 
conditions ensuring close to peak fr, i.e., Vcg = 1.5 V and Vgr = 0.91 V. 
The measurement results are then compared with the S-parameters obtained 
by using the HICUM level 2 model of the device. The device selected 
in the layout of this work was not supported by a model in the PDK so 
that an approximate set of parameters had to be generated. Figure 4.15(a) 
shows the comparison of the magnitude in dB for all the S-parameters of 
the considered transistor. The measured values for Sj; and S21 agree quite 
well with the model prediction, with discrepancies in the order of 0.2 dB, 
while S22 shows a bigger error, with a maximum value in the order of 1.1 dB 
in the entire frequency range. The Sı parameter shows the biggest relative 
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Figure 4.15 S-parameter measurements (dotted lines) versus model of the considered 
Infineon transistor (solid lines) for both a) Amplitude (in dB) and b) Phase (in degrees). 


error in magnitude, due to its small absolute value. Discrepancies between 
measurements and model are more significant when considering the phase 
information (see, Figure 4.15(b)) where they can reach 40 degrees for S19. 
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4.5 Conclusion 


In this chapter various concepts and techniques to achieve accurate cali- 
bration techniques at (sub)mm-waves for device characterization have been 
reviewed. The problems of electrically thick substrates have been explained 
and experimentally validated. Electrically thin substrates with their per- 
formance improvement were discussed. The problems and possible error 
compensations related to substrate transfers are addressed. A complete flow 
and an EM-based technique to design and characterize TRL-based calibration 
kits to be embedded in the BEOL of commercial silicon technologies were 
described. Finally, an approach to realize direct calibration/de-embedding 
kits capable of measuring the device performance at M1 was presented and 
experimentally validated. 
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5.1 Mixed-mode Stress Tests 
5.1.1 Introduction to Hot-Carrier Degradation under MM Stress 


An important reliability concern in SiGe HBTs is related to long-term 
degradation (stress or aging) effects induced by hot carriers (HCs). While 
in MOSFETs HC mechanisms produce a degradation of drain current and 
transconductance, as well as a threshold voltage shift [Tyal5], in bipolar 
transistors the HC damage is mainly related to the creation of Si dangling 
bonds acting as trap states at the semiconductor—insulator interfaces. Interface 
traps induced during device operation lead to an increased Shockley—Read— 
Hall (SRH) recombination and hence to an excess non-ideal base current 
component. Differently from MOSFETs, the collector current remains unaf- 
fected, and therefore it can in principle be stated that HC degradation is less 
critical in bipolar transistors, including the SiGe HBT technology; however, 
it still entails a number of undesirable consequences, such as current gain 
reduction (due to the base current growth), noise figure increase, shift of 
the bias point outside of the functional range, as well as increased power 
consumption in power amplifiers [Ven00, Cre04, Che09]. Such effects have 
been traditionally studied under reverse base-emitter stress conditions, where 
HCs are created by large electric fields across the base-emitter junction 
(see the early papers [Bur88, Gog00] and the more recent [Sas14a, Sas14b, 
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Fis15]); this stress test was indeed considered as appropriate for assessing 
device reliability in BiCMOS operation [Bur88]. 

Another stress technique has been subsequently proposed and quickly 
accepted in the literature, which is more representative of device degrada- 
tion in practical mixed-signal and RF circuit applications; in this technique, 
referred to as mixed mode (MM) [Zha02], the device under test (DUT) is 
usually operated in common-base (CB) configuration while being simultane- 
ously subjected to large emitter current density (Jp stress) and collector—base 
voltage (VcB stress) [the corresponding Vcg being higher than the open- 
base breakdown voltage (BVcro)] [Zhu05, Dio08, Chal5]. Although this 
biasing condition may seem too severe, the instantaneous operating point 
of a transistor (e.g., in oscillators and in noise/power amplifiers) can reach 
either high voltage or high current under large-signal operating mode, thereby 
gradually increasing HC-triggered damage [Che09, Fis08, Gre09, Fis15]. The 
high — and continuously applied — stress conditions Jp stress and VCB stress are 
also denoted as accelerating factors, since they give rise to significant MM 
stress degradation in a relatively short time. Under MM stress tests, the trap 
creation process involves the following steps [Moe 12]: 


e the large electric field across the base—collector space-charge region 
(SCR) first creates primary HCs, and then additional (secondary, tertiary, 
and so on, depending upon Vp stress) HCs by impact ionization (ID); 

a fraction of the generated HCs can be directed toward the emitter—base 

oxide spacer (used to separate the Si emitter from the extrinsic base 

region) or the shallow trench (ST) oxide edge. Along these paths, they 
lose some energy due to collisions; 

e if the HCs reach the oxide interfaces with an energy higher than 1.5 eV, 
then damage is produced in the form of dissociation of passivated Si-H 
bonds [Tyal5, Tyal6]. The damage spectrum depends on the accel- 
erating factors JE stress and VCB stress for multiple reasons: first, they 
determine the II rate, but also the device temperature (high temperatures 
can have a beneficial impact in terms of damage recovery); moreover, 
they can trigger high-current (Kirk effect) or high-voltage (pinch-in 
effect) phenomena [Che07]. The trap creation at the emitter—base spacer 
(attributed to hot holes [Van06, Kam17b]) can be monitored by mea- 
suring the forward Vcp = 0 V Gummel plot, where an SRH-induced 
growth in the low-Vpg base current is observed; the reverse Vep = 0 V 
Gummel plot is instead used to measure the damage due to HCs hitting 
the ST interface, since it causes an increase in the reverse-mode base 
current [Zha02, Zhu05, Che07, Moe12, Chal 5]. 
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5.1.2 Long-term MM Stress Characterization on IHP Devices 


Mixed mode stress has been investigated in a recent exhaustive work [Fis15], 
where — differently from previous papers — long-term stress tests were per- 
formed, plainly showing a decrease in the degradation rate at long times. 
Moreover, an accurate investigation is presented, which includes the effect 
of: (i) the accelerating factors JE stress and VCB ‚stress, (ii) stress temper- 
ature, (iii) thermal recovery, and (iv) compact modeling of stress-induced 
base current components. The results of [Fis15] reported here refer to a 
packaged single-emitter SiGe:C NPN HBT fabricated by IHP, with effective 
emitter area! Ap = Wg x Lp = 0.16 pm x 0.52 um, Wg and Lp being 
the effective emitter width and length, respectively, featuring peak fy of 
250 GHz at Jc = 18 mA/um?, peak fmax equal to 300 GHz, BVcro = 1.7 V, 
BVcpo = 5 V, and mounted in a CB configuration.” 

The procedure can be described as follows. First, a Gummel plot is 
measured at Vcg = 0 V, the HBT being still fresh (i.e., stress-unaffected). 
Then the stress bias (high JF stress and VB stress) is applied, and the evolution 
of the collector and base currents with stress (aging) time are monitored by 
measuring non-stressing Vcg = 0 V Gummel plots at chosen time instants 
and recording their values at Vgg = 0.7 V. Figure 5.1 shows the CB output 
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Figure 5.1 CB output characteristics of the DUT manufactured by IHP; also shown are the 
pinch-in locus (red dashed line) and the stress paths A, B, C. 


'The effective emitter area is the area of the interface between Si emitter and SiGe base, 
which defines the vertical current flow. 

The analysis obviously requires the availability of a number of identical HBTs, one for 
each stress test to be performed. 
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characteristics of the DUT; also reported are the locus of pinch-in occurrence, 
which represents the limit of the CB safe operating area (SOA), and the stress 
conditions: in case A, identical transistors are biased with a low J stress 
(=0.12 mA/um?) and different Vcg stress) IN experiment B, other identical 
transistors are biased with a high JE stress = 12 mA/ um? (not far away from 
the current density at peak fr) and different VCOB stress; in case C, VCOB stress 
is kept constant at 2.75 V, and different Jp stress are applied to identical 
transistors. 

The first measurement campaign was conducted by forcing the transistor 
backside to a temperature Tg = 300 K through a thermochuck. In Figure 5.2, 
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Figure 5.2 Relative base current degradation of the IHP HBT(s) as a function of stress 
time (a) for JE stress = 0.12 mA/um? and various VCB,stress (series A), and (b) for 
Vop,stress = 2.75 V and various JE, stress (series C). Also shown is extraction of exponent 
a at short and long stress times for selected cases. 
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the relative base current degradation 100 - (IBstress — /Bfresh)//Bfresh due to 
the enhanced recombination is shown for cases A and C at different values of 
VCB,stress (A) and JR stress (C); it is found that the damage increases with 
stress time following a power law dependence (~t},,,,,). For short stress 
times (within a few hours), exponent a is around 0.5 for low/medium currents 
(consistent with [Che09]); on the other hand, it is found that the degradation 
rate decreases for longer stress times, where œ approaches about 0.2. This 
is important in terms of device lifetime prediction (e.g., [Pan06]): if data are 
extrapolated from short-time experiments, the effect of degradation within a, 
e.g., 10-year timeframe would be largely overestimated. It must be remarked 
that the measured damage evolution does not indicate a trend to saturation 
within a 1,000-h-long stress time. From Figure 5.2(b) it can also be noted that 
the damage first increases with stress current, then reaches a maximum and 
declines at high currents (24 mA/um?). This “hump” behavior has also been 
observed in [Che07] and can be attributed to the higher temperature induced 
by self-heating (SH): when the temperature exceeds ~350 K, damage indeed 
reduces as a result of (i) enhanced trap passivation, which starts dominating 
over trap creation, and (ii) increased carrier scattering, which reduces the 
number of highly energetic carriers reaching the interface. Moreover, the 
time exponent a is seen to lower at high currents [Fis13, Fis16]. Although 
not reported in the figures, this SH-induced reduction in degradation is also 
observed at high Vcp stress for case B. 

In order to investigate trap passivation occurring at high temperature, 
thermal annealing experiments were carried out. A high temperature of 
the base-emitter junction Tj (about 543 K)? was reached by increasing 
Tg to 398 K and raising the dissipated power (Pp) via the application 
of VoB anneal = 1.5 V and JE anneal = 30 mA/um?. Figure 5.3 illustrates 
the relative base current reduction 100 - (IBanneal — /Bstress)/IBstress against 
anneal time for the previously stressed DUTs (series A in Figure 5.1). The 
main findings are: (i) the thermal annealing is more effective in transistors 
that underwent a heavier stress and (ii) significant current gain recovery is 
observed, independently of the stress load. 


5.1.3 Medium-term MM Stress Characterization on IFX Devices 


On-wafer medium-term MM stress tests were performed at Univer- 
sity of Naples on single-emitter SiGe:C NPN BEC HBTs manufactured 


>This value was assessed by a preliminary extraction of the thermal resistance. 
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Figure 5.3 Relative base current reduction vs. anneal time obtained by applying Tg =398 K, 
VoB,anneal = 1.5 V, and JE anneal = 30 mA/pm? to IHP HBTs previously stressed with the 
bias conditions of Figure 5.2(a) (also reported in the legend). The monitoring of the base 
current was performed in situ, i.e., at Tg = 398 K for Vses = 0.6 V and Vcg =0 V. 


by Infineon Technologies (hereinafter denoted as IFX) [Chev11]. The 
experiments were conducted on a device with effective emitter area* 
Ap = Wg x Lp = 0.13 x 2.73 pm?, exhibiting a peak fr of 240 GHz, a 
peak fmax of 380 GHz at Veg = 0.5 V, BVcro = 1.5 V, BVcBo = 5.5 Vv 
and mounted in a CB configuration. Similar to the procedure in [Fis15], the 
stress experiments were conducted by applying high Jk stress and Vop stress 
to the DUT, and monitoring the collector and base currents as a function 
of stress time through measurements of forward Vcg = 0 V Gummel plots 
at chosen stress times. A first investigation was carried out by consider- 
ing different values of JE stress (namely, 1.4, 7, and 14 mA/\m?) for the 
same VCB,stress = 2 V. The relative base current degradation evaluated for 
Ver = 0.7 V in the Vcp = 0 V Gummel plots is illustrated in Figure 5.4; it 
can be observed that (i) after 104 s the damage is still confined below 100% 
for all cases due to the low VCB stress applied, and (ii) the highest JE stress 
leads to a reduced degradation induced by the high temperature, as will be 
discussed in the following. 

Another analysis was conducted by applying the lowest Jr stress 
(=1.4 mA/j.m?) and three Vop,stress Values, namely, 2, 2.5, and 2.75 V. 


“Further details on the technological definition of effective emitter area for IFX HBTs can 
be found in [dAI14]. 

>The transistor belongs to set #3 defined in 5.4.1; again, various identical devices were 
available, one for each stress test. 
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Figure 5.4 Relative base current degradation of the IFX device(s) vs. stress time for 
VCB stress = 2 V and various JE, stress- 


Figure 5.5, showing the relative base current degradation at Vpp = 0.7 V, 
evidences that the damage increases with VCB stress due to the higher electric 
field in the base—collector depletion region, which in turn gives rise to a 
higher number of HCs with an energy higher than 1.5 eV impacting on the 
interface of the emitter—base oxide spacer and thus creating traps. It is found 
that the damage exceeds 100% for long times (104 s) for VoB,stress = 2.5 V 
and even for very short times (300 s) for Vop stress = 2.75 V. Consistently 
with other works, Figure 5.6 witnesses that a power law (t&ress) well describes 
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Figure 5.5 Relative base current degradation of the IFX DUT(s) against stress time for 
Ji stress = 1.4 mA/um? and various VCB stress- 


192 Reliability 


= 
e} 
w 


ii a ny 
=1.4 mA/um? 
=2.5V 


J E,stress 


V 


CB,stress 


102 


10! 


0° po ooa oa aal o aaa ooa 
100 101 102 10° 104 105 
Stress time [s] 


Relative base current degradation [%] 


108 


ST 


<3 2 
JE stress=! “4 mA/um 


Vos ,stress 72-75 V nre 


102 


Relative base current degradation [%] 


109 50 1 2 3 4 5 
10 10 10 10 10 10 
Stress time [s] 
Figure 5.6 Relative base current degradation of the IFX device(s) as a function of stress time 
for Ji, stress = 1.4 mA/wm? and (a) Vop,stress = 2.5 V, (b) VoB,stress = 2.75 V. Also shown is 
the extraction of exponent a at short and medium times. 


the evolution of the base current degradation, provided that a different œ is 
considered for short (high a) and medium (low a) stress times. 

Following the approach presented in [Van06], an analysis was carried 
out to gain an in-depth insight into the device behavior under MM stress 
conditions. In particular, a fresh DUT identical to the stressed ones was mea- 
sured by sweeping Vcp for various assigned Jgs. After a straightforward data 
processing based on the technique in [Lu89, Zan93] and on the knowledge 
of the thermal resistance Rpy = 7,000 K/W (determined according to the 
method in the section “Experimental Rpy Extraction”), it was possible to 
obtain the Jay — Jre (Jav being the avalanche current density) and Tj — Jg 
curves shown in Figure 5.7. It can be inferred that at low Jg the avalanche 
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Figure 5.7 (a) Avalanche current density Jay and (b) base-emitter junction temperature Tj 
as a function of emitter current density Jg for various Vcgs. Also shown are the conditions 
corresponding to the stress tests reported in Figures 5.4 and 5.5 (the same symbols were used 
for the sake of clarity). 


current Jay grows with Je as a result of the increased II related to the 
higher number of electrons traveling to the base—collector SCR; conversely, 
Jav decreases at high Jg due to the concurrent mitigating impact of high- 
injection (HI) and SH effects. Also identified in Figure 5.7 are the Jays and 
Tjs corresponding to the MM stress conditions related to Figures 5.4 and 5.5. 
The main findings are in agreement with the conclusions in [Van06] and can 
be summarized as follows: 


e although the stress test with applied JE stress = 14 mA/\1m? (star) shares 
the same Jay (0.3 mA/umĉ) and Vcg (=2 V) as the test with 
JE stress = 7 mA/um? (rhombus), in the first case the damage is lower 
due to the higher device temperature (400 K instead of 300 K, as shown 
in Figure 5.7(b)); 

e conversely, the tests carried out at the same current density 
Je stress = 1.4 mA/ wm? and different Vops (square, circle, and triangle) 
share similar temperatures and different Jays (the avalanche current 
increases with Vcg due to the higher electric field in the base—-collector 
SCR). As a result, the damage grows with increasing Vcg. 


5.2 Long-term Stress Tests 


The improved frequency performances of state-of-the-art SiGe HBTs have 
been achieved at the cost of significantly increased operating current densities 
and lower breakdown voltages [Sch17]. Thus, devices are often operated 
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closer and even beyond the border of the classical SOA; however, this 
can limit stable device operation due to reliability issues induced by the 
previously discussed HC degradation. In the following, some dedicated long- 
term stress tests are carried out to clearly identify the influence of biasing 
conditions along the SOA limit on the device reliability [Jac15]. 


5.2.1 Experimental Setup 


The stress tests were conducted on devices biased in a common-emitter (CE) 
configuration under bias conditions close to the SOA border. Since these 
conditions are not accelerating like in conventional MM tests, a long stress 
time (up to 1,000 h) was required to observe an impact on the electrical 
characteristics. The transistors are single-emitter SiGe:C NPN CBEBC HBTs 
fabricated by IFX, with an effective emitter area of Ap = 0.13 x 9.93 um? 
featuring a peak fr/fmax equal to 240/380 GHz and a BVcro/BVcpo of 
1.5/5.5 V [Chev11, Böc15]. In order to observe and record the evolution 
of the base and collector currents during the tests, non-stressing forward (at 
Vcp =0 V) and reverse (at Vep =0 V) Gummel plots were measured at fixed 
time instants during the 1,000 h-long experiments [Jac15]. Four stress bias 
conditions, referred to as P1, P2, P3, and P23, were applied at Tp = 300 K 
along the SOA boundary, as shown in Figure 5.8. The corresponding 
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Figure 5.8 Output characteristics of the SiGe HBT under test simulated using HICUM/L2. 
Also represented are the examined bias conditions (P1, P2, P3, and P23). 
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voltage (Vcp), collector current (Ic), and current density (Jc), as well as the 
(average) temperature rise AT; over the base-emitter junction (obtained from 
the thermal resistance Rpy = 2,850 K/W determined with the approach in 
[dAl14]) are summarized in Table 5.1. It must be remarked that P1 is defined 
below BVcgo, whereas P2, P3, and P23 are beyond BVcgo. 


5.2.2 Long-term Degradation Test Results 


For P1, P2, and P3, the tests were performed on six HBTs for each bias 
condition, (that is, 18 identical HBTs were measured). Figure 5.9 shows the 
forward Vcg = 0 V Gummel plots during the stress test at P1, P2, and P3, 
while Figure 5.10 illustrates the aging-induced Ig growth at Vgg = 0.713 V. 
The following considerations are in order. 


e At P3, Ig increases regularly with stress time for low Vp; at 
VpeE = 0.713 V, the variation is 120 nA after 1,000 h. 

e At P2, Ig slightly increases for low Vgg; at Vgg = 0.713 V, the variation 
amounts to 80 nA after 1,000 h. 

e At P1, no sizable degradation is monitored. 


In conclusion, the higher Vcg, the more the low-injection Ig increases with 
stress time. 

It must be remarked that a slight natural recovery is observed if the device 
stays on the shelf between two stress periods. This recovery is visible at 300 h 
in Figure 5.10 for the biasing conditions P2 and P3 (the devices were left 
unstressed for 24 h before the measurement of the Gummel plot). 

Concerning P23, the forward Vcg = 0 V Gummel plots for each stress 
time are shown in Figure 5.11, which indicates that Ig increases at low VBg, 
while remaining almost constant at high Vgg. The relative variation of the 
base current at Vgg = 0.65 V is shown in the inset, along with the variation 
measured at P2 for an identical HBT. The comparison highlights that Ig 
exhibits similar evolutions at P23 and P2 due to the same collector—emitter 
voltage (VcE = 2 V), in spite of the four times higher Jc at P3. 


Table 5.1 Stress bias conditions and corresponding junction temperatures 


PI P2 P3 P23 

Vcr [V] 1 (<BVceo) 2(>BVcreo) 3(>BVceo) 2(>BVcero) 
Ic [mA] 12.9 6.45 1.29 32.27 

Jc [mA/um?] 10 5 1 25 


AT; [K] 37 37 11 184 
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Figure 5.10 Evolution of the excess base current as a function of stress time for six identical 
HBTs tested at P1, six HBTs tested at P2, and six HBTs tested at P3. 
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Figure 5.11 Evolution of the forward Gummel plot with stress (aging) time at P23. Shown 
in the inset is the relative variation of Ig at Vgm = 0.65 V. 


The reverse Vpp = 0 V Gummel plots were also measured for all biasing 
conditions at chosen time instants during the stress tests. Since the DUTs 
have the emitter and substrate connected to the ground pad, the base voltage 
was fixed to 0 V to obtain Vpp = 0 V, and a negative collector voltage was 
swept from 0 V to —1 V to increase Vgc. Due to the forward biasing of 
the substrate—collector junction, a substrate current is added to the emitter 
current. It was found that the sum of the emitter and substrate currents 
remains almost constant regardless of the bias point, as can be inferred for 
the P23 case in Figure 5.12. Different behaviors were instead observed for the 
Ig — Vgc curves: Figure 5.12 also witnesses that at P23 Ig tends to rapidly 
increase during the first few hours for Vgc < 0.6 V, eventually saturating 
after 48 h (as shown in the inset), while remaining constant for Vgc > 0.6 V. 
No significant degradation of the reverse Gummel plot was instead observed 
at P2 (despite the same Vopr as P23) and P3 [Jac15]. It is also worth noting 
that the distortion measured in the reverse Veep = 0 V Jp — Vgc plot for 
low Vgc at P23 resembles that of the forward Vcg = 0 V Ig — Vege plot at 
low VBE. 

The forward Ip increase observed at low Vpr at P2, P3, and P23 
has been attributed to hot holes generated by II (Vce > BVcgo) at the 
base—collector SCR and then driven by the electric field to cross the base and 
hit the edge of the spacer with enough energy to create traps, as determined in 
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Figure 5.12 Evolution of the reverse Gummel plot with stress (aging) time at P23. Shown 
in the inset is the relative variation of Jp at Vgc = 0.5 V. 


[Kam17b] with an advanced TCAD simulation strategy based on the solution 
of the Boltzmann Transport Equations for electrons and holes through the 
Spherical Harmonic Expansion approach (see Chapter 2). As clarified in the 
section “Introduction to hot-carrier degradation under MM Stress,” the traps 
in turn lead to a non-ideal Jp growth via trap-assisted SRH recombination. 
The higher damage occurring at P3 is due to the higher electric field within 
the base—collector SCR, which implies a higher concentration of hot holes 
with enough energy to break the passivated Si-H bonds [Kam17b]. 

On the other hand, the damage at the ST-Si interface (witnessed by the 
distorted Ig — Vgc plots) at the high-current P23 condition may be associated 
with both hot holes and hot electrons induced by II, as suggested in [Moe12]. 

The physical locations of the defects are illustrated in Figure 5.13 with 
the help of a cross section obtained from a TCAD simulation of the device 
structure. It is shown that the upper region of the ST-Si interface is more 
affected by hot holes, whereas the lower region is more affected by hot 
electrons. 

Another numerical analysis was performed using Sentaurus TCAD by 
Synopsys [Syn] to obtain a deep insight into the degradation mechanism; 
the hydrodynamic model with optimized parameters reported in [Sas10] 
was activated. More specifically, simulations were performed to extract the 
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Figure 5.13 Physical origin of the base current degradation represented in a cross section 
within a TCAD environment. 


evolution of the trap density N; at the spacer interface (P2, P3, and P23) and 
ST-Si interface (only P23) as follows: for each time instant at which the non- 
stressing forward and reverse Gummel plots were measured and recorded, 
N, (assumed to be at an assigned energy level E, such as E, — Ey = 0.6 eV, 
Evy being the valence band limit) was optimized so as to align the simulated 
plots with the experimental ones. Figure 5.14 reports the matching between 
measured and computed forward Vcg =0 V Gummel plots at P3 after 7 and 
750 h. Figure 5.15 illustrates the extracted N, at the spacer interface as a 
function of stress time for P2 and P3 [Jac15]. 
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Figure 5.14 Forward Gummel plots at Vcs = 0 V at P3 after (a) 7 h and (b) 750 h of stress. 
Measurement results (symbols) are compared with the simulated (solid) counterparts. 
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Figure 5.15 Trap density evolution along the interface of the emitter—base oxide spacer vs. 
stress time at P2 and P3. 


5.2.3 Low-frequency Noise Characterization 


Noise characterization can be considered as a diagnostic tool for analyzing 
quality and reliability of bipolar transistors [Vand94, Moh00]. Flicker noise 
is ubiquitous in almost every electronic device, although its origin is still 
deep in dispute. Unlike silicon BJTs, current HBTs are often affected by sig- 
nificant generation-recombination (G-R) noise (not so common in large-area 
devices) at low frequencies, mostly originated in the device external surface 
and periphery [Cos92, Tut95]. These noise sources may lead to presence of 
significant random telegraph signal (RTS) noise that can be observed in the 
time-domain noise signal. 

Hereinafter, a comprehensive analysis of the RTS noise in IFX SiGe:C 
HBTs is presented, in which dominant G-R mechanisms are evidenced at low 
bias currents in smaller geometries, as confirmed by RTS noise measurements 
[Muk17]. In larger geometries the RTS noise is not so frequently observed. 
Eventually, extractions of RTS time constants and their evolution with bias 
are analyzed to get insight into the active G-R mechanisms. The weak evo- 
lution of the RTS noise amplitude with bias indicates that the noise sources 
are located in the base-emitter peripheral region. Consequently, distinct RTS 
is observed at the collector side that is activated at high current regimes. 
This indicates the activation of traps located in trench areas due to fixed 
imperfections. 
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The noise characterization setup includes a Keysight E5270B semicon- 
ductor parameter analyzer for DC biasing, an HP 35670A dynamic signal 
analyzer for the measurement of voltage noise spectral density, and a Femto 
DLPVA-100-F-S low-noise voltage amplifier, which has a variable gain up 
to 100 dB with a bandwidth of 100 kHz and an 1 TQ input impedance. 
The measurements were performed at a gain of 40 dB. The entire on-wafer 
measurement system is connected through a GPIB interface and is controlled 
via the ICCAP software. The noise spectral densities of the transistors 
are measured in V?/Hz (averaged over 20 spectra). The time-domain RTS 
noise voltage was measured using the dynamic signal analyzer. The system 
noise floor was determined to be 2 x 10717 V?/Hz. During the biasing, a 
very high source resistance (Rg) was considered due to the current source 
for the base biasing, and a 50 Q resistance was used as load resistance 
(RL). The values of transistor parameters 6, rx, Rg, and Rp were extracted 
from DC measurements. The RTS noise was measured on single-emitter 
SiGe:C NPN HBTs fabricated by IFX (see the sections “Medium-term MM 
Stress Characterization on IFX Device” and “Experimental Setup”) with 
various terminal configurations and effective emitter areas, as summarized in 
Table 5.2. In order to eliminate process variation, the noise was measured on 
several devices (five to eight) of the same geometry from different dies. 

The forward Gummel plot for HBT #3 is shown in Figure 5.16, which 
depicts the bias range of the noise measurements. 

Figure 5.17 shows the base voltage noise spectral density (Sy) for 
transistor #1 at Vgg = 0.7 V and Vcr = 1 V. It can be clearly observed 
that two distinct G-R mechanisms (referred to as GR1 and GR2) are active. 
For all the investigated geometries, high G-R noise was observed in the 
low-frequency noise spectra at lower bias. These G-R noise mechanisms 
were particularly visible in smaller geometries. The corresponding RTS are 
shown in the two insets. Interestingly, it is found that one RTS (GR2) 


Table 5.2 Details of the DUTs for RTS noise measurements 
N° Configuration Apg(€Wg x Lg) [um?] 


1 BEC 0.13 x 2.71 
2 BEBC 0.13 x 4.91 
3 CBEBC 0.11 x 9.93 
4 BEBC 0.17 x 9.91 
5 BEBC 0.25 x 9.91 
6 BEBC 0.61 x 9.91 
vA BEBC 1.61 x 9.91 
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Figure 5.16 Forward Gummel plot showing the bias range for noise measurements. 
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Figure 5.17 Sy, showing different G-R mechanisms and their corresponding RTS in time 
domain for transistor #1. 


is superimposed on the other (GR1). The measured Sy, shows GR1 at a 
frequency of 8 Hz while the corresponding RTS measurement reveals an 
average time constant <T> of 16.2 ms, equivalent to a G-R cutoff frequency 
(fc) of 9.8 Hz (since 7 = 1/2afc). This confirms the existence of GR1. 
The other RTS (GR2) can be observed in the 0-100 ms range of the time- 
domain signal superimposed on the principal RTS (GR1) having a smaller 
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time constant of 2 ms (fc = 81 Hz) that corresponds to the GR2 at 78 Hz. 
In [Pas04], the observations are quite similar for SiGe HBTs, where a G- 
R is observed only at low bias and in a frequency range below 100 Hz for 
smaller devices; it was illustrated from time analysis that this G-R component 
is related to RTS noise. In our results, the bias dependence shows a weak 
evolution of the RTS amplitudes. This indicates that such a G-R mechanism 
is not located in SCRs, and possibly originates at base-emitter periphery 
[vHa02]. Similar G-Rs were observed in base current noise spectra in earlier 
stages of this work [Muk16a, Muk16b]. 

Figures 5.18 and 5.19 illustrate the Syg and the corresponding RTS at 
different bias conditions for the smallest (transistor #1) and largest (#7) 
geometries, respectively. Significant G-R contributions are clearly observed 
with large RTS time constants in #1, whereas #7 does not show significant 
G-R at low frequencies. For example, at Vgg = 0.725 V, transistor #1 exhibits 
significant GR1 (at 10 Hz) and GR2 components (around 40 Hz), which 
correspond to time constants of 15 ms and 3 ms (superimposed RTS) in 
the RTS spectra, respectively. As the bias increases, the RTS time constants 
become smaller, indicating a faster response from the traps, and at higher bias, 
such as 0.9 V, the G-R mechanisms completely disappear leading to absence 
of any RTS in the time-domain noise response. Transistor #7 does not show 
dominant G-R contribution: a minor G-R contribution can be seen around 
105 Hz (time constant of 1.5 ms) that is observed in the RTS at Vpp = 0.7 V. 

Evidently, the existence of significant RTS noise in smaller geometries is 
well accepted [Pas04]. In our case, the RTS corresponds to the existence of 
GR1 in the noise spectral density at low frequency, which we have identified 
as a contribution due to emitter periphery. 

Figure 5.20(a) shows the base RTS noise response of transistor #4 at dif- 
ferent bias conditions. Figure 5.20(b) witnesses that the RTS time constants 
at both the low (t1) and high (ty) states scale with 1/exp(qVpp/kT), except 
for the highest Vgg where the devices are entering the medium/high injection 
regime. The capture rate of carriers inversely depends on the available carrier 
density in the trap position, which can increase with bias. However, a small 
electric field decrease in the SCR due to a higher Vpr is not sufficient to turn 
into such a rapid roll-off in the characteristic time [vHa02]. This indicates that 
there must be an additional bias dependence on the trapping and de-trapping 
mechanisms. In [vHa02], the authors explained their results by tunneling of 
electrons from the neutral regions across the SCR into traps located in the 
spacer oxide near the periphery. As Vpp increases, the tunneling distance 
decreases due to reduced SCR width, resulting into faster trap response and 
therefore reduced RTS time constants (Figure 5.20(b)). 
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Figure 5.18 Sv, showing different G-R mechanisms at different bias (Vgs) conditions and 
their corresponding RTS in time domain for transistor #1. 
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Figure 5.19 Sv, showing different G-R mechanisms at different bias (Vgg) conditions and 
their corresponding RTS in time domain for transistor #7. 


5.2 Long-term Stress Tests 207 


ol (a) Transistor #4 (0.17 x 9.91 pm’) ig as 
a | I" boii i [mmr na Meson attr eng Pul 
zt | 
nal ú “ L 

2 Vis =0.725V 

ooh y: a 
al mL LLM mn NTL 


Tae nnn Ea 


il Væ = 0.8 V 


AV pe (AV) 
= 


Va" "W 
Zo 
A | 
2 | 
-10 
10 4 
Vp = 0.85 V 
5 
ea 
c 
-5 M 
bis 
< ao 
15) 
400 500 600 900 1000 
Time (ms) 


~Vexp(qV,/AT) E 


(b) 


0.65 0.70 0.75 0.80 0.85 0.90 


Vion Y) 


Figure 5.20 (a) Base RTS at different bias conditions, (b) corresponding time constants for 
the low and the high states as a function of bias (Vse) for transistor #4. 
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Figure 5.21 shows the RTS noise current amplitude (A/Jp) of the base 
noise for different geometries. A very weak bias current dependence (~/3°"') 
is found for smaller transistors, and an almost insignificant dependence is 
observed for larger geometries. This further corroborates that the RTS noise 
sources at the base side are located in the emitter—base periphery regions. 
Also in [vHa02] it was stated that when AJ/p scales with non-ideal base 
current component, these fluctuations often originate from noise sources in 
the spacer oxide at the emitter periphery. In our HBTs a large dispersion in 
AIp/Ip ratios was observed (between 0.3% and 3%) from device to device. 
In larger devices the A/p//p ratio has a relatively higher magnitude, yet 
this ratio inversely scales with Ip in all geometries. Different A/p/Ig ratios 
indicate slightly different physical origins of traps in different geometries. 

The bias dependence of the collector RTS is presented in Figure 5.22(a), 
which shows the RTS at different Vcps for transistor #5. Figure 5.22(b) 
illustrates the extracted trap time constants as a function of the collector—base 
bias for two geometries (#4 and #5) at Vgg = 0.9 and 0.8 V, respectively. It is 
expected that the time constants of high and low states are higher in larger 
geometries at lower bias since the tunneling distance is higher. However, 
the characteristic times remain almost constant for higher Vcpgs in both 
cases, and the steady-state value is reached faster in the case Vpr = 0.9 V 
(onset of high-current effects), whereas a sharp transition at lower Vcg is 
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Figure 5.21 Base noise RTS amplitude (A/g) as a function of bias (Ig) for different 
transistor geometries. 
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observed at Vgg = 0.8 V. The saturation of trap response at higher Vcg 
indicates that these RTS noise sources are possibly located at the top of 
the ST walls, and even if the base—collector SCR enlarges, the trap density 
at the trench sidewalls remains fixed. At Vgg = 0.8 V, a sharp transition is 
observed when the SCR is narrow (Vcg ~0.1 V), and with the SCR spreading 
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the tunneling time constants change due to activation of new traps until it 
reaches saturation. Conversely, due to high-injection (Vgg ~0.9 V) and base 
pushout effects, tunneling times remain constant since all the sidewall traps 
are already aligned within the SCR area. 

In conclusion, this comprehensive analysis of the RTS noise in advanced 
SiGe:C HBTs demonstrates the evidence of dominant G-R mechanisms at 
low bias currents in smaller geometries, whereas in larger geometries the 
RTS noise is not observed. The bias dependence of the RTS reveals a weak 
evolution in the noise amplitude indicating that the noise sources are located 
in the base-emitter peripheral region. Distinct RTS is observed at the col- 
lector side also near high current regimes, which was attributed to activation 
of traps located in ST walls. This highlights the applicability of RTS noise 
characterization to probe these imperfections via non-destructive means. 


5.3 Compact Modeling of Hot-Carrier Degradation 


The technology selection for the fabrication of integrated circuits is mainly 
driven by both performance and reliability criteria; in particular, the lifetime 
is one of the most crucial factors. To evaluate lifetime, the aging behavior of 
a specific degradation effect can be studied with TCAD simulations focusing 
mostly on bias-temperature instability and HC injection in MOSFETs, and 
on MM degradation in bipolar transistors. TCAD simulations can provide 
some in-depth information on the physical mechanisms. However, these 
simulations are done at the expense of very long simulation times, thus 
making them unviable for circuit design. Hence, it is necessary to develop 
a more practical circuit simulation platform using electrical compact models 
at transistor level suited to efficiently capture the physics of the degradation 
through aging laws and subsequently reflect it at circuit level. 


5.3.1 Empirical Equations by IHP 


Based on the long-term stress results discussed in the section “Long-term 
Degradation Test Results,’ IHP developed the following empirical equation 
for the base current degradation: 


Alp = CMM j f (VCOB stress, JE,stress) $ (cE is ie (5.1) 


where tstress is the stress (aging) time and q is a time-dependent power factor 
[Fis15, Fis16]. By referring to the early stress stage (t < 0.1 h), this general 
dependence on the stress conditions was extracted [Fis15]: 
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1mA/um? JEstre ie 
f (VCB stress, JE stress) = exp (poVonses}:( / + a 


JE, stress JEhc 
(5.2) 
where po = 1.5 V~! and Jebe = 25 mA/um?. 
The long-term development of the logarithmic aging rate 
Qog = d (log ATp) /d (log tstress) (5.3) 


can be approximately fitted to the observed base current degradation by 
means of the power coefficient 


0.3 
(tstress an Jn) 


a (toires JEn) = 0.2 + (5.4) 


0.1870: 


with pre-factor cyp = In. and JEn = JE, stress/J Emin, 1.€., the stress current 
density normalized to the minimum applied density Jgmin = 0.12 mA/jm?. 
This equation is rather complex because J; stress has enormous influence on 
the development of aging rate, as can be seen in Figure 5.23, where aging over 
a range of stress currents and up to 1,000 h has been successfully simulated 
by modifying the recombination current of an HBT compact model with the 
above equations. 


2 
“e (b) 8 
10° _ ety g 

— 4 CB,stress ` 3.25 V igi CB,stress Š 

L 10 E,stress = 0.12 mA/ym? E 

ae 10° © 

= ‘ © 

2 10 S 

š © 

2 10” è 

: 1s) 

or : p 5 

5 Y E,stress 10 Q 

2 ə ; a 0.12 mA/um? 3 

® 10 

© : 

a 10” g 

O 


0 


1 
06 07 0.8 0.9 10°10" 10° 10' 10° 10° 10° 10° 
Viel] Stress Time [h] 


Figure 5.23 (a) Forward Gummel plots and (b) base current degradation of IHP HBTs 
simulated with an empirical aging function (dashed lines). 
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5.3.2 HICUM-based Model 


Various compact models have been developed for mm-wave circuit applica- 
tions. One of the most commonly used model for SiGe:C HBTs is referred 
to as HIgh CUrrent Model (HICUM) [Sch05, Sch10, Sch13], and is based 
on the General Integral Charge-Control Relation (GICCR) [Sch93]. The 
GICCR allows taking into account the relevant transport mechanisms through 
a physical-based approach; this is the reason why HICUM was chosen to 
implement the aging laws based on the experimental results presented in the 
section “Long-term Degradation Test Results.” 

As described before, the degradation (i.e., the base current growth at low 
Vee) is due to an HC-induced increase in trap density over the interface 
of the emitter-base oxide spacer. In HICUM the base current in forward 
mode is subdivided into various components [Sch10], namely, the current 
Ijgri injected into the intrinsic part of the emitter as a main component, 
and the peripheral current, in turn composed of the back-injection current 
across the emitter perimeter junction ljBgẸp and the recombination current in 
the perimeter base-emitter SCR /REp; /jBEp and Rgp are given by: 


— ) | 
I =f - [exp | ——— ] - 1 5.5 
jBEp = /BEps | p (e. (5.5) 
VBEjp ) | 
Í =I - [exp | ———=— ] - 1 5.6 
REp = IREpS | p (e (5.6) 


where Vppip is the peripheral internal (junction) base—emitter voltage, while 
the saturation currents IBgps and /REps, as well as the non-ideality factors 
MBEp and MREp, are model parameters. In [Gho10, Gho1 1], it was shown that 
a possible approach to simulate the Jp degradation in InP HBTs is to use IREp 
given by Equation (5.6). More specifically, the Reps evolution is expected 
to follow the N, evolution VS. fstress (extracted as, e.g., in the section “Long- 
term Degradation Test Results”). In a simplified approach in which IRgps is 
assumed to saturate for long stress times, the dependence of IRgp on time can 
be accounted for in HICUM through the following differential equation for 
TREps: 


ORES L ATSF .(G—R- Irrps) (5.7) 


where G is the generation rate and R (fitting parameter) the annihilation rate 
of traps, while ATSF is an Aging Time Scale Factor added to shorten the 
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Figure 5.24 New transistor circuit used for aging law implementation in HICUM. 


simulation time needed to have a perceptible stress effect to minutes (instead 
of tens of hours). The generation rate G depends on the bias conditions; in 
particular, it is an increasing function of the collector—base voltage (Vcp) 
due to the enhanced II current Jay (accounted for in HICUM [Sch10]). 
The following linear relation was proposed in [Jac15] to include this 
dependence: 

G = A- Iaw + Go (5.8) 


A and Go being fitting parameters. Equation (5.7) with (5.8) was implemented 
in the Verilog-A code of HICUM/L2 by including the additional circuit shown 
in Figure 5.24. 


5.4 Thermal Effects 


Thermal issues have become a serious concern in SiGe HBTs due to the 
concurrent impact of the following factors: (i) the shrinking of the intrinsic 
device has induced a growth in power density within the base—collector SCR 
for a given bias condition; (ii) the trench isolation — exploited to reduce 
parasitics, crosstalk, and increase fmax — limits the heat spreading since 
trenches are filled with materials suffering from low thermal conductivity 
[Rie05, dA110, Youl1, Pet15]. This mechanism is even exacerbated by lateral 
scaling, which results in a horizontal reduction of the Si volume embraced 
by trenches; (iii) HBTs are operated at high current densities to boost the 
frequency performance, which entails a further increase in dissipated power 
density [Cre13]. Owing to these considerations, thermal effects can be viewed 
as an undesired, yet unavoidable, by-product of the technology evolution. 
Unfortunately, the enhanced heat generation (for a given dissipated power) 
and the reduction in heat removal have pushed the thermal resistances (Rp) 
of SiGe HBTs into the thousands of K/W [EIR12, Has12, Sah12] and even 
beyond 104 K/W for small emitter windows, as evidenced by recent experi- 
mental campaigns conducted on transistors fabricated by STMicroelectronics 
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(hereinafter referred to as STM) [dAI10] and IFX [dA114]. Thermal effects 
can lead to a severe distortion of the DC device characteristics (e.g., [LaS09]), 
and also degrade the low-frequency and high-frequency (since the DC bias 
is altered) behavior; besides the performance penalty, they may also affect 
the long-term reliability, and even trigger destructive instability phenomena. 
Consequently, care must be taken in assessing the impact of the thermal 
behavior in advanced technology nodes. 


5.4.1 Experimental Ry, Extraction 


Since the steady-state thermal behavior of a device is fully described by the 
thermal resistance (Ry), a plethora of methods to experimentally extract this 
critical parameter in bipolar transistors have been developed. Among them, 
particular interest has been paid to approaches based on DC measurements, 
for which low effort and relatively cheap instrumentation are required. 
The most widespread method — presented in slightly different variants in 
the literature [Daw92, Pfo03, Rie05] — relies on the measurement (1) of the 
temperature-sensitive base-emitter voltage to employ its temperature coeffi- 
cient as a thermometer, and (ii) of the base-emitter voltage as a function of 
collector—base voltage (or dissipated power) at an assigned emitter (collector) 
current. A sticking point of this technique is the thermometer calibration, 
which can be impacted by SH and thus entail a thermal resistance overesti- 
mation. A strategy to purify this procedure from SH has been proposed by 
Vanhoucke et al. [Van04]. Here an alternative to [Van04] is presented, which 
suggests a logarithmic law for the current dependence of the temperature 
coefficient of the internal (junction) base-emitter voltage Vggj and can be 
explained as follows. 

In the absence of HI and II effects, the collector current Ic of a SiGe HBT 
(which exhibits marginal Early effect) can be described by the simple model 


Verj + > Ue) AT; 


Ic = Ag J so ex 
C EJ S0 p nVro 


(5.9) 


where Vpggj is given by: 
Vex = Vee — kelp — Rplp (5.10) 


Rp, Rg being the parasitic base and emitter resistances. In Equation (5.9), 
Ag = We x Le is the effective emitter area; Jso is the reverse saturation 
current density, n (= 1) is the ideality factor, and Vro is the thermal voltage, 
all at temperature To = 300 K;  [V/K] (+0) is the temperature coefficient of 


5.4 Thermal Effects 215 


Vpxj (in absolute value) and AT; is defined as Tj - To, Tj being the (average) 
temperature over the base-emitter junction. This implies that Equation (5.9) 
accounts for the temperature dependence of Ic (~Ig) making use of a Vggj 
shift, by keeping Jso, n, and Vro at their To values (e.g., [Zha96]). The o 
dependence on Jc (Jp) can be described with the following logarithmic law 
[Nen04, dA110, dAl14, dA116, dAl17]: 


k Ic k Ig 


Ic) = In N In 5.11 
o (Ic) = %0 i Tei bo i7 maT (5.11) 


Parameters Jso and y in Equation (5.11) can be optimized by invoking the 
following procedure. First, the Jc — Vgg characteristic of the DUT is mea- 
sured at various thermochuck temperatures Tg under CE conditions by 
keeping Vcg small and sweeping Vpgg up to values sufficiently low to 
reasonably neglect SH, HI, II, and resistive effects; as a consequence, the 
Ic — Vpr curves can be modeled by: 


Vee + > (Ie) - (TB — =) 
nVro 

which stems from Equation (5.9) by considering Tj = Tg and Vggj = Ver. 

Parameters Jso and n are then tailored to match the experimental curve at 

Tp = To with 


Ic = AgJso exp | (5.12) 


Vj 
Ic = ApJon exp ( 2E ) (5.13) 
n Vro 


and ọọ is safely (SH is negligible) calibrated so as to ensure good agreement 
between all the Jc — Vpr characteristics (at different Tgs) and the model 
given by Equation (5.12) with (5.11). Once po is known, Equation (5.11) can 
be used also at medium current levels, where the extraction of dg would be 
inaccurate due to SH. Further details concerning the derivation of Equation 
(5.11), as well as the physical meaning of parameter po, can be found in 
[dAl14]. Combining Equations (5.9) and (5.10), it can be obtained that: 


Ic 
Vee = Rplp + Rele — 6 (Ic) AT, + nVro ln ——— aie (5.14) 
By exploiting the thermal equivalent of Ohm’s law, AT; is expressed as: 
AT; = Tj — To = RryPp + Tp — To (5.15) 


where Pp is the dissipated power. If Tg = To, Equation (5.15) can be 
recast as: 


AT; = RruPp = Rerun: (Veele + Voplo) ~ Rra: (Vee + Vos) Ie 
(5.16) 
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wherein use has been made of the Pp expression in terms of applied or mea- 
surable voltages and currents under CB conditions. By substituting Equation 
(5.16) into (5.14), 


_ Rele — > Ue) Rra Voste + Vroln qs 
1+ (Us) Rra le 

If a CB measurement is performed at Tp = To under a Vcg range limited to 

low values so as to avoid II effects, at a constant Ig sufficiently low to prevent 


HI and non-linear thermal effects, yet high enough to lead to perceptible SH, 
the (negative) slope y of the Vgg — Vcp characteristic is given by: 


_ dVpr _ > (Ie) Rrule 
dVop 1+ 6 (Up) Rrule 


whence the thermal resistance (Rra) can be evaluated as [dAI10, dA114, 
dA116, dAl17, Kam17a]: 


(5.17) 


VBE 


(5.18) 


lal sp e 
(1—|7)¢Ue)fe Use) Te 


In [dA114], this improved approach was applied to about 100 single-emitter 
SiGe:C NPN BEC HBTs manufactured by IFX. The transistors are divided 
into three sets corresponding to different technology stages (and scaling 
strategies), which are hereinafter denoted as #1, #2, and #3. In particular, 
(i) set #2 is slightly scaled (both laterally and vertically) compared with #1; 
the collector current of HBTs belonging to #2 at peak f'y is about 30% higher 
than that of the #1 counterparts with approximately the same emitter area; 
(ii) set #3 devices underwent an aggressive lateral scaling with respect to 
#2 ones, while being vertically similar to them. The key figures of the sets 
are reported in Table 5.3. The thicknesses of the shallow and deep trenches 
are equal to 0.3 um and 4.5 um for all HBTs, respectively. For each set, 
transistors with several combinations of emitter width/length were available. 


RTH = (5.19) 


Table 5.3 Key figures of the analyzed IFX technology states 


#1 #2 #3 
BVczo [V] 6.5-6.8 5.2-5.9 5.1-5.5 
Peak fr @ Vos =0 V [GHz] 190 225 235 
Jc @ peak fr, Vcs =0 V [mA/um?] 6.5-7.0 9.0-9.5 9.5-10 
Peak fr @ Vop =0.5 V [GHz] 215 230 240 
Peak fmax @ VcsB =0V [GHz] 250 310 330 


Peak fmax @ Vos =0.5 V [GHz] 280 350 380 
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Figure 5.25 illustrates the experimentally extracted Rpys as a function 
of Lp at assigned widths Wg for the three sets. It is shown that RTH 
(i) significantly increases by reducing Lp and (ii) is well above 10° K/W and 
can grow beyond 104 K/W for small emitter areas. In particular, the smallest 
DUTs of sets #1 (Ag = 0.20.57 m7), #2 (Ag = 0.14<0.39 um°?), and #3 
(Ap = 0.11 0.63 um?) suffer from Rry = 14,300, 21,000, and 22,000 K/W, 
respectively. 

Numerical evidence of the accuracy of this technique was provided in 
[Kam17a], where it was applied to the simulation of an IFX SiGe:C DUT 
belonging to set #3 through an advanced tool solving the Boltzmann transport 
equations of electrons, holes, and longitudinal optical phonons, as well as the 
Energy Balance Equations for the other phonon modes (see Chapter 2). 


5.4.2 Thermal Simulation 


A viable strategy to assess the impact of technology on the thermal behavior 
of SiGe HBTs involves the adoption of 3-D finite-element method (FEM) 
thermal simulations, which are suited to handle structures with arbitrarily 
complex geometries [Rei01, Wal02]. 

An interesting contribution has been given in [Sah13], where non-linear 
steady-state, large signal, and sinusoidal thermal analyses of an STM SiGe:C 
HBT (with drawn emitter area equal to 0.27 um x 10 wm) were carried 
out with Sentaurus; the thermal resistance was found to be in fairly good 
agreement with the one measured according to the procedure in [Pfo03], 
although no thermal conductivity degradation mechanisms (e.g., due to high 
doping) were accounted for. The Back-End-Of-Line (BEOL) structure was 
found to play a marginal role due to the absence of the metal-via stack above 
the emitter. This analysis has been recently extended to cover the influence of 
BEOL on the thermal behavior of multi-finger devices, with emphasis on the 
coupling among fingers [Dwil16]. 

In [dA110], the software package Comsol [Com] was adopted to analyze 
SH in several STM SiGe:C HBTs. In spite of their geometrical complexity, 
the devices were reproduced with a very high accuracy up to the emitter, base, 
and collector contacts, the top surfaces of which were considered adiabatic, 
that is, the BEOL architecture was not included. Unfortunately, although the 
upward heat flow was unrealistically suppressed, the numerical Rpys were 
found to underestimate by about 20-25% the experimental values deter- 
mined through the technique described in the section “Experimental RTH 
Extraction,” independently of technology stage and emitter size. An improved 
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Figure 5.25 Thermal resistance (Rru) as a function of emitter length (Le) for various 
emitter widths (Wg), as experimentally determined for sets (a) #1, (b) #2, and (c) #3. 
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variant of the approach in [dA110] was applied to IFX transistors in [dA116]. 
The advances with respect to [dA110] are reported below: 


e The whole BEOL structure, comprising five metal (copper) layers and 
related interconnections (copper vias between metal layers, tungsten 
contacts between silicon and the lowest metal layer), was taken into 
account, as well as the external pads, as witnessed by Figure 5.26 
reporting the Comsol grid. This allows quantifying the cooling influence 
due to the upward heat flow (often disregarded in the literature), which 
is expected to be relevant since — differently from the STM transistor 
analyzed in [Sah13] — a metal-via stack is located over the emitter in the 
IFX DUTs. 

e In bipolar transistors, the power dissipation occurs at the base—collector 
SCR. In conventional approaches for thermal simulations, for a rectan- 
gular emitter window, such a region is modeled as either a rectangular 
or a parallelepiped heat source (e.g., [dA110, Sah13]), both with uniform 
power density. In [dAI16], the dissipation region is more accurately 
modeled by resorting to 2-D electrical simulations of the DUTs pre- 
liminarily performed with Sentaurus in order to determine a realistic 
power density distribution; for this aim, the hydrodynamic model with 
transport parameters optimized for SiGe:C HBTs [Sas10] was used. 
By referring to the schematic cross section of the DUTs represented 
in Figure 5.27, the heat sources exploited in the Comsol structures 
were built with the power density pattern obtained by reproducing the 
distribution computed by Sentaurus in the (x, z) plane and assuming a 


Figure 5.26 Detail of the 3-D Comsol mesh for the IFX transistor with 
Ap = 0.13 x 2.73 wm’, composed of 1.35 million tetrahedra of grossly different dimensions, 
corresponding to 1.8 million degrees of freedom. 


220 Reliability 


l= 2G TGe 


materials 

silicon 
~~ silicon dioxide 
= trench polysilicon 
{=m emitter polysilicon 
mmm base polysilicon 
mas CoSi 
mg SiGe 
m tungsten 


=m heat source 


ST=shallow trench 
DT=deep trench 


Figure 5.27 Schematic representation (limited to the innermost tungsten contacts) of the 
typical cross section of the IFX DUTs. 


uniform density along the device length (i.e., along the y-axis orthogonal 
to the cross section). 
Thermal simulations are usually performed by setting the thermal con- 
ductivities k [W/mK] of the materials to values measured from “bulk” 
samples (listed in Table 5.4). However, in practical cases, many effects 
concur to reduce k, which can be even position-dependent within the 
same material. In the SiGe alloy, k is a function of the z-dependent Ge 
mole fraction xGe according to the law [Pal04] 


(1 = TGe) TGe ia 


kgice = + 
SiGe ksi kGe 
Table 5.4 Bulk thermal conductivities 

Material Bulk Thermal Conductivity [W/mK] 
Silicon 148 
Germanium 60 
Silicon dioxide 1.4 
Tungsten 177 
Copper 390 
Emitter polysilicon 40 

Base polysilicon 30 
Trench polysilicon 20 
Cobalt silicide 9.6 
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where kg; and kge are the thermal conductivities of pure Si and Ge, respec- 
tively, and cy, is a bowing factor equal to 2.8 W/mK. Due to the k lowering 
imposed by Equation (5.20), the SiGe layer behaves as a barrier for the heat 
flow from the heat source to the emitter [Pet15]. The thermal conductivity 
is also adversely impacted by doping due to the enhanced phonon-impurity 
scattering, as experimentally observed in [Sla64, McC05, Lee12]; a compact 
formulation to account for this effect is [Lee12]: 

ksi doped (KsiGe,doped) = ksi (ksice) a (5.21) 

1+A: ( vx) 


where N [cm~ °] is the position-dependent total doping concentration (accep- 
tors and donors), Nnorm = 102° cm~?, while the values of the parameters 
are A = 0.74186, a = 0.7411 for boron [Lee12], and A = 1.698, a = 0.8251 
for arsenic, as obtained with a calibration procedure relying on experimental 
results provided in [McC05]. Lastly, the heat propagation through laterally 
thin layers can be significantly jeopardized by the phonon scattering with 
the layer boundaries [Liu05]. In SiGe HBTs, where the heat flow is mostly 
vertical, scattering mechanisms — expected to be exacerbated in narrow (low- 
We) transistors — can take place along device portions like (from the top) 
emitter tungsten contact, Si emitter, SiGe base, and Si volume surrounded 
by ST. This deleterious effect can be included by using, e.g., the simple 
analytical method proposed in [Tor00], which leads to a reduced anisotropic 
thermal conductivity with x-dependent components given by: 


0.75 
Byala) = 1—łexp ( ) 
ksi,doped (ksiGe,doped:Kcontact ) 2 Tcharyz 522 
1 1-2! oS s i ) 
2 exp GES ) 
kx(2") EE a \095 
ksi,doped (kSiGe,dopedKcontact ) 2 Tcharx (5 23) 


1 exp | a] 

where x’ = x/W, W being the layer width (along y), Xcharyz = 0.32 - A/W 

and Xcharx = 0.72 - A/W, A being the mean free path for phonons (equal to 

300 nm in Si and SiGe layers, and to 40 nm in the tungsten emitter contact). 

It must be remarked that only Equation (5.20) was accounted for in 
[dA110]. 
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Figure 5.28 Thermal resistances as a function of emitter width for IFX devices sharing 
Ly = 2.73 um: experimental (squares) values are compared with those calculated through 
the simulation approaches A (circles), B (triangles), C (flipped triangles), D (rhombi), and E 
(left-oriented triangles). 


As discussed in [dA116], Comsol steady-state simulations were per- 
formed by applying an adiabatic boundary condition at the top and lateral 
faces of the structure, and an isothermal condition on the backside (Tg = To). 
The thermal resistance was determined by evaluating the average of the 
temperature field over the base-emitter junction, which mostly influences 
the behavior and performance of the device [Zha96], subtracting To and 
normalizing to the dissipated power (Pp). Results corresponding to DUTs 
with different Wes and sharing Lg = 2.73 um are reported in Figure 5.28, 
which shows: 


e the Rrys determined through the improved experimental technique 
outlined in the section “Experimental Ry Extraction”; 

e the Rrys simulated with Comsol by considering the full advanced 
approach described above (denoted as approach A), i.e., by including the 
BEOL architecture and accounting for the non-uniform power density 
pattern and the conductivity degradation mechanisms; 

e the Rrys calculated with Comsol by modeling the power dissipating 
region through a standard parallelepiped-shaped source with uniform 
power density, while considering all other effects and the BEOL struc- 
ture (approach B); 

e the Rpys computed with Comsol by accounting for a heat source 
with non-uniform power density and replacing the metal in the BEOL 
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architecture with SiO» so as to virtually exclude it, while including the 
first-level tungsten contacts only (approach C); 

e the Rpys evaluated with Comsol by restoring the BEOL, and consid- 
ering uncorrected “bulk” values for the thermal conductivities and a 
standard parallelepiped-based heat source (approach D); 

e the Rrus computed with Comsol by disregarding the above effects and 
excluding the BEOL so as to emulate a traditional simulation technique 
(approach E). 


By using approach A, the Rpy of the device with Wp = 0.13 um was 
calculated to be 6,437 K/W, which is in fairly good agreement (—5.3%) with 
the experimental value (6,800 K/W); conversely, a relatively high underes- 
timation (-15%) was obtained for the widest (Wg = 0.55 um) device, the 
numerical and measured Rys being 4,333 K/W and 5,100 K/W, respectively. 
A post-processing analysis revealed a markedly non-uniform temperature 
distribution along x over the base-emitter junction compared to low- Wg tran- 
sistors, which can be ascribed to the concurrent action of the low kgice and the 
narrow tungsten emitter contact (the width of which does not scale with Wg). 
As a consequence, the evaluation of Ry with a standard geometrical AT; 
average over the whole junction is likely to be incorrect, and the accuracy 
should be improved by developing more complex averaging approaches that 
would lead to a higher FEM Rry. If approach B (with the traditional heat 
source representation) is adopted, the numerical Rry lowers (compared to 
A) from —9% for the HBT with Wg = 0.13 um to —5.9% for the one with 
We = 0.55 um, where the base-emitter temperature is non-uniform. Hence, 
it can be stated that the heat source representation plays a significant role. 
By making use of the BEOL-free approach C (upward heat flow almost 
annihilated), the FEM Ryy of the transistor with Wg = 0.13 um grows to 
8,712 K/W, which corresponds to +28% with respect to the experimental 
value; this means that, although the low-conductivity SiGe base and Si 
emitter concur to limit the upward heat flow, the BEOL effectively extracts 
heat from the emitter. This mechanism is also amplified by the doping- 
affected conductivity of sub-collector, which counteracts the downward heat 
propagation. Similar considerations hold for the other narrow HBTs, whereas 
for the device with Wg = 0.55 um the lower overestimation (+14%) can 
be again attributed to the too simple geometrical averaging procedure for 
the junction temperature field. By exploiting approach D, the DUTs enjoy 
an exacerbated cooling effect dictated by the BEOL architecture and the 
adoption of the “bulk” thermal conductivity of Si, which favor both the 
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downward and upward heat flow. Consequently, the FEM Rrys are far lower 
(about —45%) than the experimental counterparts. As expected, employing the 
traditional approach E leads to an underestimation of about —20% regardless 
of Wr, since the deactivation of the k reduction mechanisms (which would 
imply a heating effect) prevails over the BEOL absence (which would instead 
cool down the device). 


5.4.3 Scaling Considerations 


Thermal effects in SiGe HBTs still need to be included in the circuit design 
process via suitable compact models, which require a geometry-scalable 
lumped description of the thermal resistance, i.e., an expression of Rpy as 
a function of Wp and Lg for a given technology stage. 

The following simple law was proposed for HICUM/L2 [Sch13]: 


2 Rruo 
l+awWe + aL Lg 


Rru (5.24) 


where Ryo [K/W], aw [um +], aL [umt] are fitting parameters. Another 
formulation, conceived for Mextram504, relies on the preliminary knowledge 
(from experiments) of the thermal resistance (RTHref) of a reference transis- 
tor, and three dimensionless fitting parameters (b4, bw, by) to be calibrated 
[Wu06a, Wu06b]: 


RTHref 


Do — 1) + bw (wey —1) +81 (a — 1) 

(5.25) 
Lastly, a more sophisticated model was developed by resorting to the follow- 
ing procedure. The exact closed-form solution to the heat transfer equation for 
a rectangle-shaped indefinitely-thin heat source (THS) with area Wg x Lp 
located on the adiabatic top surface of a semi-infinite homogeneous “bulk” 
domain (with thermal conductivity k) is given by [Rin00] 


ra (ee Le + y Wg + Li 
T e | 

mM PE \ In t+ Wet LE 

1 We + y WÈ + Li 


In 


We \ We + ./W2+ IL? 


Rr = 


(5.26) 
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It is worth noting that the width and length of the THS were assumed to coin- 
cide with the emitter ones, which is a reasonable assumption. Unfortunately, 
Equation (5.26) with k = 148 W/mK (thermal conductivity of Si, as can be 
seen in Table 5.4) revealed to be unsuited for SiGe HBTs: the Rpy values 
were found to be about 65-75% lower than the experimental counterparts 
(addressed later) although the cooling effect due to the upward heat flowing to 
the BEOL structure is not modeled. This means that the heating effect caused 
by the shallow/deep trenches filled with low thermal conductivity materials — 
not included in Equation (5.26) as well — plays a role more important than 
BEOL. Equation (5.26) can be recast in the form: 


2 
Wy 
aE 1+ (H2) +1 
= ln 


TH Drk = 7 l 
14 (4) H1 
1 Ly Ly 
In 


If Wg/Lg <1, the square root can be approximated with a first-order Taylor 


series expansion 
Wp \? 1 (We\? 
t1le14 2 
( Ly 2 ( Ly vee 


By substituting Equation (5.28) into (5.27) and neglecting the second-order 
terms, it is found that: 


1 f2 Inp\ 1 _ We We 
Re en (272) + ge eh + FE) -h-E 
(5.29) 


Finally, by expressing also the logarithms with a first-order Taylor series 
expansion, after some algebra, 


1 Lg 
N In { 2 1 3 
Rru ae [i ( =) + | (5.30) 


Equation (5.30) represents a good approximation of the exact (5.26) for heat 
sources with medium/high aspect ratio Wg/Lg, while slightly losing accuracy 
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when Wr->Lg. It was empirically demonstrated that Equation (5.30) can be 
extended to a wider range of Wg values by introducing a correction term 
equal to 0.12 in the logarithm argument, which leads to: 


1 Lg 
= =——_|}] .12 + 2— 1 31 
Rru aa [in (0 + a) + | (5.31) 


In order to potentially predict the Lg and Wg dependence of the Rry for SiGe 
HBTs of a specific technology stage, Equation (5.31) was further generalized 
to [dA114]: 


1 Lpg cw 
Rays los L14 5.32 
E i [m (c R =Æ) | re) 


where c, cp, cw and the thermal conductivity (k) are fitting parameters. In 
particular, the term cw/Wp was introduced to ensure a good fitting over a 
broad Wg span. It must be remarked that considering k as a fitting parameter 
has physically sense in SiGe HBTs, since the heat emerging from the dis- 
sipation region propagates through various materials with different thermal 
conductivities (e.g., Si, SiGe, poly, oxide, and tungsten). Models (5.24), 
(5.25), and (5.32) with optimized parameters (see Table 5.5) were compared 
to experimental data determined with the approach described in the section 
“Experimental Rpy Extraction” on set #2 devices for various Wgs and three 
emitter lengths in Figure 5.29. 

Results can be summarized as follows: law (5.24) relying on three fitting 
parameters (the calibrated Ryo is well above the range of the experimental 
Rrys, and thus cannot be interpreted as a real thermal resistance) and (5.25) 
based on three fitting parameters plus a “reference” (measured) thermal resis- 
tance are suited to offer a fairly good matching with experimental data within 
a wide range of Lg and Wy values. Excellent agreement is provided by (5.32), 
which is an extended version of a formulation derived for homogeneous 
“bulk” domains, and makes use of four fitting parameters, one of which is 
thermal conductivity. Interestingly, it was found that the optimized k value 


Table 5.5 Optimized parameters of the scalable Rra models 


Model Parameters 
(5.24) Rryo = 34,893 K/W aw = 4.46 m7! ay = 1.44 ym? 
(5.25) Retret = 3,570 K/W ba =0.034 bw =0.045 bu = 0.792 


(We x Le =0.14 x 5.69 um?) 
(5.32) c=0.97 cr = 1.265 cw =0.0166 k=80 W/mK 


References 227 


10000 p i 


A © m experimental 


0.5 1 3 
Effective emitter width W- [um] 


Figure 5.29 Comparison between scalable models (5.24) (dashed lines), (5.25) (dotted), 
(5.32) (solid) with calibrated parameters, and experimental Rrus (symbols) for set #2 
transistors. 


(80 W/mK) is lower than the Si counterpart, which is physically reason- 
able since the lateral heat propagation is mostly influenced by shallow/deep 
trenches filled with the low-conductivity materials like poly and oxide. 
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6.1 Millimeter-wave Benchmark Circuits and Building 
Blocks 


A. Mukherjee, W. Liang and M. Schröter 


The continuous progress of SiGe:C HBT BiCMOS process technology paves 
the way for high-volume low-cost mm-wave and sub-mm-wave applications. 
The design of the corresponding high-frequency (HF) integrated circuits 
requires accurate compact models for both active and passive devices. Espe- 
cially, the compact models for active devices must cover many physical 
effects occurring in advanced process technologies and address a wide bias, 
temperature, and geometry range as well as high-frequency (HF) effects such 
as non-quasi-static delay and substrate coupling. Devices used in HF circuits 
typically operate at 3 to 10 times the circuit speed due to the harmonics 
generated within the circuits that ultimately determine the signal shape. The 
verification of compact models at such a high speed has become a major 
issue since device measurement capability has not kept pace with process 
and circuit development. While there has been some effort toward extending 
small-signal (S-parameter) measurement capability toward several 100 GHz, 
direct experimental verification of compact models for large-signal operation 
at mm- and sub-mm-wave frequencies still appears illusive. For instance, 
load-pull measurements beyond 50 GHz are not only difficult and expensive 
but also do not provide any phase shift information, which is important for 
describing time-dependent large-signal switching correctly. 
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The demand for model accuracy to ensure one-pass success for saving 
R&D cost in mm-wave and sub-mm-wave circuit design forces compact 
models of transistors to undergo tests in a vast range of operating conditions 
instead of merely verifying typical device characteristics. Therefore, model 
verification has been extended to small circuits in which transistor operation 
can be tested under realistic application-relevant conditions. These circuits 
comprise benchmark blocks and small building blocks of larger systems. 

Benchmark circuits on the one hand have to be sufficiently simple so as 
to avoid masking compact transistor model deficiencies by other effects, but 
should on the other hand resemble the typical transistor operation in related 
larger circuit building blocks. A well-selected set of benchmark circuits 
should allow the transistors (and their associated models) to be exercised 
in application-relevant operating modes beyond the typical standard device 
characteristics measured in a characterization lab. In addition, the same 
benchmark circuits can also be employed for evaluating process performance 
and for detecting processing issues in terms of the targeted applications 
during the process development phase. 

The circuit building blocks are concerned with practical needs toward, 
e.g., lowering power consumption or utilizing the transistor non-linearity for 
harmonic power generation in mm-wave circuits. Here, transistor operation 
in extreme regions is of interest, e.g., at low collector—-emitter voltages 
(i.e., at significantly forward-biased base—collector junction) or beyond the 
open-base breakdown voltage. The related building blocks target competi- 
tive figures of merit (FoMs) and serve also for demonstrating the process 
technology’s capability. 

Using these relatively small circuits for the above-mentioned purposes 
has so far been hampered by various factors. On one side, modeling and 
process engineers lack the necessary circuit design expertise and on the other 
side circuit designers have little interest in designing, from their perspective, 
relatively simple circuits. In DOTSEVEN, for the first time, an attempt was 
started to better bridge these two worlds by fabricating a set of circuit blocks 
partially designed by the modeling community of the project. The experi- 
mental results of the various circuits were then compared with simulations in 
order to establish a solid understanding of the accuracy of the compact models 
under circuit-relevant constraints. Several examples are presented below. 


6.1.1 Benchmark Circuits 


A. Broadband amplifier using a Darlington pair 
The broadband amplifier (BBA) is an integral part of both wireless and 
wireline communication systems. Figure 6.1 depicts two variants of the BBA 


6.1 Millimeter-wave Benchmark Circuits and Building Blocks 237 


Figure 6.1 Schematic of the broadband Darlington amplifier (a) without and (b) with 
peaking inductor (Lp). 


schematic, with their input and output matched to the 50 Q system impedance. 
This type of amplifier generally shows a low-pass behavior, i.e., it provides 
its maximum gain at low frequencies. Here, the BBA topology is derived 
from the basic Darlington configuration [Mukh16, Gray08, Vera13, Veral14], 
but uses a modified Darlington pair consisting of an emitter follower and a 
common emitter transistor. 

The degeneration resistance (Rp) actually increases the terminal 
impedance of the stage and can at the same time be used to bias the transistor 
Q1. The advantage of the Darlington configuration over a simple single 
degenerated stage is that an appropriate choice of Rg yields a current gain 
bandwidth which can approach twice that of a single stage [Armi89]. 

The operating points of the transistors are adjusted, as shown in the 
schematic, through external bias-Tees and the resistor (Rp). The feedback 
resistance (Rp) allows adjusting the gain flatness. One of the main aspects 
of benchmark circuits is the need to be able to quickly design them, prefer- 
ably by modeling or process engineers. This requires the development of a 
generic design procedure [Mukh16], which may not achieve world-record 
performance but guarantees a working circuit that meets the circuit purposes. 
Such a procedure is given below: 


1. The design starts with choosing Vcc at or near BVcro as specified by 
the corresponding process design kit (PDK) documentation. 

2. Both the transistors are biased at Jc(fT peak) according to the fr — 
Jo plot (cf. Figure 6.2). The corresponding Vpgg values determine 
VBB = VBE1 + VBr2- 

3. As no explicit input and out matching network is used in the circuit, the 
emitter length of Q1 is adjusted so as to make the real part of the input 
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impedance 50 Q. For both transistors the minimum emitter width should 
be used. 


. Since Q1 is operated as emitter follower, the current through Rp is 


much larger than the base current into Q2 so that Rp, ~ Vpro/Ic1. 


. The initial emitter length of Q2 can then be chosen similar to Q1, but 


needs to be adjusted according to its larger Vcg to maintain operation at 
JoGT peak): 


. The initial value of Rp can be obtained from, Rp/[1 — S21(f = 0] = 502. 
. To enhance the bandwidth of the amplifier, a peaking inductor (Lp) 


can be added in series to the input of Q1. The value of Lp can be 
calculated from the resonance condition at the input of Q1, know- 
ing its input impedance, and at the 3 dB frequency of the gain of 
the BBA without peaking inductance. The value can be calculated by 
Ly = Imag(Zjn)/(27f3aB), where f3ap is the original 3 dB frequency of 
the amplifier. 


. Further optimization of the circuit is required after EM simulation of the 


entire circuit. 


Two variants, with and without the peaking inductor, of the BBA were fabri- 
cated in IHP’s first DOTSEVEN technology run. The backend of that process 
offered seven metal layers with five thin metals and two thick top metal layers 
[IHP03]. Important transistor characteristics along with the comparison to the 
compact model data are shown in Figure 6.2. 


The “topmetal2” of the process has been used to realize the inductor and 


the other required transmission line interconnects. Both amplifier versions 
were configured for on-wafer measurements using GSG probes along with 
DC biasing through bias-Tees. The die micrographs of the two amplifiers 
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Figure 6.2 Comparison between measured data (symbols) and compact model HICUM/L2 
results (lines). (a) Transit frequency fr vs. Jc for different Vgc values. (b) Transconductance 
8m VS. frequency for Vgc = OV and at different Jc = (1, 5, 10, 20) mA/um?. 
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are shown in Figure 6.3. The total die area of the individual amplifiers is 
just 0.05 mm? and fits into regular HF GSG pads also used for transistor 
characterization. The two amplifiers were biased with a single supply voltage 
of 1.8 V. 

The S-parameters were measured using a Keysight PNA-L5235A with 
110 GHz extenders. The measurement includes effects of pad parasitics, on- 
chip transmission lines connecting input and output and other components of 
the amplifier layout, i.e., no de-embedding of those elements was performed. 
The small signal gain S21 in the frequency range of 0.5—-110 GHz along with 
a comparison with post-layout simulation is shown in the Figure 6.4. 

The measurement shows a bandwidth of 78 GHz for the BBA without 
peaking inductor and of 109 GHz for the amplifier with peaking inductor. 
The measured stability factors for both amplifier versions are shown in 


(a) (b) 
Figure 6.3 Die photograph of the BBA (a) with and (b) without peaking inductor. The chip 
size in both cases is (0.245 x 0.18) mm?. 
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Figure 6.4 (a) Small-signal gain and (b) stability factor of the BBA with and without peaking 
inductor: comparison between simulation (lines) and measurement (symbols). 
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Figure 6.4(b) and ensure unconditional stability. The simulations agree quite 
well with the measurements. 

Transistor models for advanced SiGe HBTs have to cover many phys- 
ical effects in order to achieve the desired accuracy for enabling first-pass 
design. The resulting model complexity makes it difficult to understand the 
impact of each physical effect or model parameter on circuit performance 
under realistic operating conditions. Knowing this dependence is important 
for model developers, circuit designers, and process engineers. Therefore, a 
sensitivity analysis was performed by analyzing the changes of the relevant 
circuit FoMs with respect to variations in model parameters. For the BBA 
here, the gain (S21), the input and output reflection coefficients S11 and 
S22, the stability factor k, and the bandwidth were selected as the important 
FoMs. The model parameters of the two transistors (Q1 and Q2) were varied 
separately to identify the model parameters that are most influential on the 
above-mentioned FoMs. The Figure 6.5(a) displays the maximum relative 
sensitivity of the above mentioned FoMs for those model parameters of Q2 
that cause changes of more than 5% in at least one of the FoMs as a response 
to a +20% change in the respective model parameter. Figure 6.5(b) shows for 
Q1 the three model parameters that have the highest impact in at least one of 
the FoMs. 

It can be observed in Figure 6.5 that the emitter resistance (re), the low- 
current transit time (t0), and the thermal resistance (rth) of Q2 have the 
biggest impact. re causes a large change in input return loss as a small change 
in its value directly affects the corresponding BE-voltage of the transistor. 
The emitter resistance (re) of Q1 also shows considerable impact on input 
return loss but its contribution is masked by the large value of Rp in series. 
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Figure 6.5 Sensitivity of the series peaked BBA performance parameters with respect to 
HICUM model parameters for the transistors (a) Q1 and (b) Q2. 
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The influence of t0 is caused by the associated diffusion capacitance ($11), 
which dominates the input capacitance, and the base—collector voltage- 
dependent mobile charge in the transfer current, impacting the output con- 
ductance and thus S22. Interestingly, the bandwidth is mostly impacted by 
rth of the second transistor Q2. The sensitivity analysis of the BBA without 
series peaking inductance shows a very similar trend and thus is not shown 
here. 


B. W-band low-noise amplifier (LNA) 

In this section, the design and implementation of a wide-band LNA for the 
frequency range of 90-110 GHz is described. The architecture employed here 
includes a shunt—shunt feedback resistor along with an input matching LC z- 
network and a post-cascode series peaking inductor [Lin07]. The basic idea of 
the z-network is to add a low-pass filter at the input side of the LNA [Lin07]. 
It enables the input impedance of the LNA to be matched with the source 
impedance (50 Q) when the input frequency is less than the cutoff frequency 


of the low-pass filter, 
1 


< ~ á 
fs 2ry LBC 


Considering the equivalent circuit (EC) as shown in Figure 6.6(b), the input 
impedance (Zin) can be written as, 


(6.1) 


Z = s’RpC,zLp + slp + Ry (6.2) 
E sS RfCinCrLB + s2 LBCin + sRf (Cin + Cr) + 1 ` 


Setting Zin = 50 Q and with s = jw, the above equation can be solved for two 
frequencies with perfect input impedance matching, 


SnV bei 


Figure 6.6 (a) LNA with 7-network for input matching, (b) small signal equivalent circuit 
of the LNA with feedback resistance Rfg and input matching elements. 
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wi = 0 (6.3) 


2 1 


“2 IsO Z 


(6.4) 
For the design of the LNA the cascode configuration is adopted due to its 
better reverse isolation and higher small-signal gain. The optimum DC bias 
voltage for the common-emitter transistor was found from the NF yin VS. 
Jc measurement (cf. Figure 6.7) of a separately measured single-emitter 
transistor. 

Figure 6.8 shows the schematic of the implemented wide-band LNA. 
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Figure 6.7 NF and NF yin versus Jc for a SiGe HBT with Ago = 0.7 x 0.9 um? for 
Vor = 1.2 V at f = 90 GHz. 


Figure 6.8 Schematic of the single-stage wide-band (90-110 GHz) LNA. 
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At mm-wave frequencies, the layout of the circuit plays a pivotal role 
for the circuit performance. Initially both transistors were contacted up to 
top metal. The insertion of the series inductor Lp and parallel capacitor 
Cin compensates for the effect of the input capacitance Cr. This strategy 
produces a third-order ladder type low-pass filter network which can reduce 
the imaginary part of Yı and hence increase the input-matching bandwidth 
[Lin07]. The initial values of Lp and Cin were calculated with the help of 
Equation (6.4) and can be further adjusted for optimized performance. Cin 
was implemented using the MIM capacitor between metal5 and topmetal1. At 
the desired high frequencies, Lp was realized using the available topmetal2 
of the process. A small degeneration inductor Lg (~20 pH), implemented 
as a metal line, was added to ensure good linearity and better stability 
[Ko96, Afsh06]. After several simulation iterations, the value of Rp was fixed 
to 415 Q. 

The output matching network consists of Lc and Co, the values of which 
were carefully chosen to provide S22 matching over the frequency range of 
interest. A small peaking inductor Lp was added to achieve a better small- 
signal gain (S21) flatness. All interconnects between the passive elements 
were realized with topmetal2 and were EM-simulated to include the effects of 
the layout parasitics. The base biases of the two transistors were fed through 
3 KQ resistors that use unsalicided, p-doped gate polysilicon as resistor 
material [IHP03]. 

Depending on the biasing of this circuit different results are obtained. 
One goal here was to verify the compact model for low-power applications 
operating in saturation. Therefore, the DC bias values at the base terminals 
were chosen as Vp; = 0.94 V and Vpə2 = 1.6 V, respectively. Together with a 
supply voltage (Vcc) of 1.4 V this ensured that both transistors work in the 
“saturation region”, i.e., with positive external Vgc of about 0.23 V. 

The on-wafer S-parameter measurements were performed using a 
Keysight PNA-L5235A with 110 GHz extenders. The noise of the amplifier 
was measured at the IMS lab in Bordeaux, France. Figure 6.9 shows the 
S-parameters of the fabricated amplifier along with circuit simulation. The 
moderate performance of S92 up to 80 GHz affects the output reflection 
coefficient of the measurement equipment. Generally, the agreement between 
measurement and simulation is quite satisfactory though. 

The noise measurement was performed from 75 GHz to 90 GHz, which 
was the highest frequency range for which a noise source and respective 
measurement equipment were available. The measured NF at 90 GHz is 5 dB, 
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(a) (b) 
Figure 6.9 (a) Small-signal results of the wide-band LNA: comparison between mea- 
surement (dashed lines) and simulation with HICUM/L2 (solid lines). (b) Corresponding 
frequency-dependent noise figure NF: comparison between measurements (symbols) and 
simulation of (blue line) and NF min (red dashed line). 


which is slightly less than simulated. The simulation was performed with the 
noise-correlation model turned on. 

Figure 6.10 shows the results of a sensitivity analysis for the designed 
LNA where the model parameters of the two transistors Q1 and Q2 were 
varied separately. In case of transistor Q1, the impacts of only those model 
parameters are shown that cause the maximum relative sensitivity of the 
relevant FoMs to vary at least 4% in response to a parameter variation in the 
range of +20%. In case of Q2 the three most influential model parameters are 
shown. 
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(a) (b) 
Figure 6.10 Sensitivity of the wideband LNA performance parameters with respect to 
HICUM model parameters for the transistors (a) Q1 and (b) Q2. 
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It can be observed from the above figure that the low-current transit time 
(t0) and the associated delay time of the transfer current (alit*t0) of Q1 and 
Q2 have the biggest impact on mainly the noise figure (NF). The delay time 
enters the sensitivity through the noise correlation. The gain of the cascode 
amplifier is basically controlled by the CE transistor (Q1) rather than the 
transistor in CB mode (Q2). The emitter resistance (re) of Q1 mostly impacts 
the input reflection and gain. Generally, the FoMs are less sensitive though to 
the parameters of Q2 compared to those of Q1. 

It must be mentioned here that in this sensitivity study the model 
parameters are varied individually, i.e., their correlation through process 
and structural parameters of the transistor were ignored [Schr05]. A study 
including the correlations can be done using a special transistor scaling tool 
[Schr99]. 

The performance of LNAs can be compared through the following FoM, 


Bandwidth (GHz) x Gain (dB) 


Pitas 
RENA (Fang — 1) x PpoolmW) ’ 


where, Favg is the average noise factor within the band and Ppg is the 
DC power dissipation of the circuit. In this FoM, the gain in decibels used 
as the power consumption is proportional to gain in decibels [Sato10]. 
Table 6.1 compares the performance of this LNA with other state-of-the-art 
broadband mm-wave LNAs reported recently. Despite operation in saturation, 
the performance compares reasonably to the other designs and its FoM is only 
exceeded by an 80 nm HEMT amplifier. 


6.1.2 Circuit Building Blocks 


So far, some results going along this direction have been reported. In [Seth11, 
Inan14], LNAs for the 8-12 GHz and 10-22 GHz bands were designed 
with reduced supply voltages. At a higher frequency, a 65 GHz LNA was 
implemented in a 130 nm SiGe HBT process in [Agar14]. A 53.5 GHz SiGe 
HBT oscillator with only 0.5 V supply voltage was reported in [Sah14]. In 
the following context, the design of a W-band low-power LNA is presented, 
which uses an ultra-low supply voltage (Vcc = 0.5 V). This work aims 
to give an example showing how far the DC power consumption can be 
reduced while maintaining meaningful circuit performance for a mm-wave 
LNA with HBT transistors biased in the saturation region. The impact of 
varying transistor series resistances on voltage gain, minimum noise figure 
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(NF min), and third-order input intercept point (IIP3) is also investigated for 
this low-power LNA. 

Furthermore, the design of a W-band frequency tripler with 0.5 V supply 
voltage is presented, aiming at an output signal at 96 GHz from a 32 GHz 
input signal with as low as possible DC power consumption. This frequency 
tripler could be used as a candidate for generating W-band signals, together 
with a fundamental-tone oscillator located at a much lower frequency, to 
alleviate the problem of directly designing a W-band oscillator with satisfac- 
tory performance. Comparison between simulated and experimental results is 
given to verify the accuracy of HICUM model parameters at W-band when 
transistors are used to design non-linear mm-wave circuits with a reduced 
supply voltage. 


6.1.2.1 W-band low-noise amplifier (LNA) with 0.5 V supply 
voltage 

IHP SG13G2 SiGe HBT technology was used for designing the LNA in 
this section. Figure 6.11(a) shows that a collector current density of more 
than 10 mA/um? is needed to bias the HBT from this technology at its 
peak transit frequency (fp). The corresponding base-emitter DC bias voltage 
(VBeg) can be obtained from Figure 6.11(b), which shows that at least 0.85 V 
is needed to achieve such a collector current density. Therefore, if the design 
of an LNA with ultra-low supply voltage is targeted (like Voc = 0.5 V), 
then the transistor has to be biased in the saturation region (Vgc > 0.35 V). 
Figure 6.11(a) also implies that the HBT in this technology can still provide 
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Figure 6.11 (a) Transit frequency of an HBT from IHP SG13G2 versus its collector current 
density with Vgc = -0.5, 0, and 0.5 V. (b) Corresponding transfer characteristics. 
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Figure 6.12 Schematic of the three-stage common-emitter LNA. 


an acceptable value for peak fp when Vgc equals 0.5 V (around 260 GHz, 
compared with the value of around 320 GHz with -0.5 V Vgc in the normal 
forward-active case). In other words, with a supply voltage as low as 0.5 V, 
this transistor still retains a decent speed for designing mm-wave circuits. 

The topology of the LNA, shown in Figure 6.12, consists of three stages 
of common-emitter configuration with emitter—collector transformer feed- 
back to improve reverse isolation and stability at high frequencies. Besides 
stability, the emitter series inductor also serves as part of the impedance 
matching network. The amplifier is biased with Vi, = 0.89 V and Vec = 0.5 V, 
while the total power consumption of the three stages is only 2.79 mW (1.86 
mA for each stage). The topmost metal layer provided by the technology 
(TopMetal 2) is used to fabricate the transmission lines, whereas the lower 
metal layers (TopMetal 1, and Metal 5/4/3/2) are used for transitions going 
through different low-level layers. The bottom metal layer (Metal 1) is used 
as the ground plane all over the layout of the circuit. The LNA is designed 
with the aid of constant available power gain circles and constant NF circles 
of each stage. 

Figure 6.13 illustrates the constant available power gain circles and 
constant NF circles of the transistor used in the first stage of the amplifier 
(without transformer feedback) at 94 GHz. The source impedance posed to 
the base of the transistor (transformed from a 50-2 signal source by the input 
matching network) is selected as close to the center of the constant avail- 
able power gain circles as possible for higher power gain, while the source 
impedance is also chosen as close to the center of the constant NF circles as 
possible for lower noise mismatch (leading to lower noise contribution by the 
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Figure 6.13 Constant available power gain circles (blue curves, from 6.15 dB to 5.35 dB 
with a 0.2 dB step) and constant NF circles (red curves, from 3.48 dB to 4.28 dB with a 0.2 dB 
step) for the first stage of the amplifier at 94 GHz. 


corresponding amplifier stage). Therefore, in practice a compromise has to be 
made between power matching and noise matching in an LNA design. 

The measured and simulated S-parameter results of the three-stage 
amplifier are shown in Figure 6.14(a). With only 0.5 V collector supply 
voltage, this LNA can still provide 14.38 dB peak power gain at 91 GHz 
and more than 10 dB power gain over a frequency range from 86 GHz 
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Figure 6.14 Measured (symbols) and simulated (lines) results of the W-band ultra-low 
power three-stage LNA: (a) S-parameters and (b) noise figure. 
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to 100 GHz. The measured and simulated NFs of the LNA are shown in 
Figure 6.14(b), where the correlated noise has been turned on (flcono = 1) 
and off (flcono = 0), respectively. The measured NF at 90 GHz is 5.44 dB. 
The measured input-referred 1 dB compression point is —21 dBm at 91 GHz. 
Fairly good agreement between measurement and simulation (especially 
for the S-parameter results) has been achieved, which verifies the accuracy 
of the compact model (HICUM/L2) with a forward-biased BC junction at 
high frequencies (W-band). Figure 6.14(b) also shows that including noise 
correlation is not negligible, when there is a demand to accurately capture 
the noise performance of amplifiers at W-band. Note that noise correlation 
increases with frequency. 

To investigate the impact of transistor series resistances on the circuit 
performance of this LNA (S21, NF min, and IJIP3), a sensitivity analysis 
was performed at 92 GHz for the emitter, base, and collector series resis- 
tances (+30% variation) as shown in Figure 6.15. The corresponding model 
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Figure 6.15 (a) Absolute sensitivity of LNA FoMs w.r.t. to series resistance variation. 
Detailed variation of (b) S21, (c) minimum noise figure, (d) input referred third-order intercept 
point, all w.r.t. to series resistance variation. 
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Table 6.2 Performance summary of the W-band LNAs 


Freq 20*logio|S21| NF ITP3 Poc 
Reference Technology (GHz) (dB) (dB) (dBm)* (mW) 
[Cetil2] 45 nm CMOS 95 10.7 6 14.6 52 
[Vigil6] 28 nm CMOS 90 28 7 -2.7 31.3 
[Sev10] 130 nm SiGe 95 9 8.6 -5.3 13 
[May10] 120 nm SiGe 95 23 8 N/A 28 
[Yang13] 90 nm SiGe 90 19 5.1 -10.4 43 
[Ina14] 90 nm SiGe 94 10 4.2 -1.9 8.8 
This work 130 nm SiGe 90 14.3 5.44 -9.1 2.79 


*The listed IIP3 results are estimated from the reported input-referred 1 dB compression 
points. 


parameters are the zero-bias internal base resistance rpio, the external base 
resistance rp, the emitter resistance rp, and the external collector resistance 
rcx. Regarding the sensitivity of the input-referred IIP3, one would expect rp 
to have the largest impact on the linearity of an amplifier due to the series 
negative feedback introduced by this resistance. A reason for the less-than- 
expected change of IIP3 may be that the impact of rg (8.16 Q) is masked 
by that of the inductor in series at the emitter (cf. schematic in Figure 6.12), 
which has an impedance (w*L) of 8 Q at 92 GHz. The detailed variations 
of S21, NFmin, and IIP3 with regard to the variations of series resistances 
are shown in Figures 6.15(b—d). The fact that rp, has the biggest impact on 
the FoMs confirms that, at least for the considered process technology, the 
maximum oscillation frequency is the more relevant standard device FoM. 

The performance of this LNA, along with the comparison with other 
reported LNAs operating around 90 GHz, is summarized in Table 6.2. The 
results of this work clearly imply the option of operating transistors with very 
low collector supply voltage (0.5 V) while maintaining a reasonable power 
gain and NF performance at W-band. 


6.1.2.2 W-band low-power frequency tripler 

In this section, the design of a W-band low power frequency tripler is 
introduced. This frequency tripler is also designed in IHP SG13G2 SiGe 
HBT technology. The schematic of the core part of this frequency tripler is 
shown in Figure 6.16. The transistors used in the core harmonic generation 
cells have an emitter size of 0.07 um x 0.9 um x 3. The total DC power 
consumption (including the buffer amplifier) is 4.66 mW with a 0.5 V supply 
voltage [Lia17]. 
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Figure 6.16 Schematic of the W-band low-power frequency tripler. 


The topology of the tripler consists of two parts: the harmonic generation 
part and the output buffer amplifier part. Differential configuration is used 
in the harmonic generation part to suppress the even-order harmonic signal, 
with the use of on-chip baluns for single-ended-to-differential conversion. 
Extensive electromagnetic (EM) simulation was performed during this design 
as shown in Figure 6.17. The small-signal input/output return loss results 
are measured from one break-up harmonic generation cell as shown in 
Figure 6.18(a), which implies that the strongest output signal occurs at around 


Z 


Output balun 


/ 


Input balun 


Harmonic generation core 


Figure 6.17 Layout and corresponding EM simulation views of the frequency tripler. 
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Figure 6.18 (a) Input and output return losses of the core part of the frequency tripler; 
(b) Conversion gain (actually loss) of the frequency tripler. 


96 GHz with an input at around 32 GHz, which is as expected for the 
correct function of a W-band frequency tripler. The simulated and measured 
conversion loss results of the frequency tripler are shown in Figure 6.18(b), 
which shows a minimum conversion loss of 3.79 dB when generating a 
96 GHz output signal. These conversion loss results are measured with only 
—10 dBm input signal over the frequency range of 26-36 GHz. 

The performance of this frequency tripler is summarized in Table 6.3 
along with the performance of some other reported W-band frequency 
triplers. The work in [Yeh13] has also demonstrated an ultra-low power 
frequency tripler using the injection-locking mechanism, but the required 
input signal power can be as high as 6 dBm, which will impose significant 
additional power consumption and design effort on the preceding circuit 
blocks. Looking at Table 6.3, it seems that the design of the frequency 
tripler in this work has proved the potential for utilizing high-speed SiGe 
HBT technology biased in the saturation region to implement a competitive 


Table 6.3 Performance summary of the W-band frequency triplers 


Freq Pin Peak Conv. Harmonic Ppoc 
Reference Technology (GHz) (dBm) Gain(dB) Rejection (dB) (mW) 
[Chen 10] 65 nm CMOS 85-95 .2 0 -13.5 >30 19.8 
[Wang 12] 180 nm SiGe 96 0 -7 >20 75 
[Hung10] 150 nm mHEMT 72-114 14.5 -20 — 120 
[Vish12] 90 nm CMOS 90-115 8 -2 — 17 
[Yeh13] 90 nm CMOS 94 -1 — >20 3 


This work 130 nm SiGe 88.5-103.5 -10 -3.79 >30 4.66 
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W-band frequency tripler while greatly reducing the DC power consumption. 
Also note the relaxed and thus cost-efficient process node (130 nm) of the 
SiGe technology used here. 


6.2 Millimeter-wave and Terahertz Systems 
U. Pfeiffer, R. Jain, J. Grzyb and P. Hillger 


With DOTSEVEN technology it becomes conceivable to realize high-speed 
circuits operating up to fundamental frequencies of 300 GHz and with 
utilization of higher harmonics (sub-harmonic operation) even beyond the 
intrinsic cutoff frequency of the active device. This is the portion of the elec- 
tromagnetic spectrum, where millimeter-wave and terahertz-systems meet, 
and where advanced SiGe HBT technologies have a wide-range potential. 
For instance, the RF bandwidth in communication systems is typically in 
the order of 10% of the carrier frequency, at 300 GHz, this provides a 
wide absolute bandwidth of 30 GHz, enabling data-rates in the order of 
tens of gigabits per second. Similarly, future high-precision radars will profit 
from the abundant bandwidth at frequencies above 200 GHz and terahertz 
3D computed tomography (CT) imagers can be entirely implemented in a 
silicon process technology. The design, simulation, and performance of this 
emerging application space are described in the following. 

The section “240 GHz SiGe Chipset” describes a 240 GHz SiGe chipset 
for ultra-high data-rate communication at frequencies above 200 GHz. The 
high fmax achieved in IHPs DOTSEVEN technology enabled the design 
of high-performance fundamentally operated 240 GHz transmitter (Tx) and 
receiver (Rx) chip-set fully packaged including an on-chip primary antenna 
coupled to a secondary low-loss hyper-hemispherical silicon lens antenna. A 
record data-rate of 40 Gbps for QPSK modulation was demonstrated. 

The section “210-270 GHz Circularly Polarized Radar” describes a 
240 GHz circularly polarized FMCW radar demonstrator in [HPs DOT- 
SEVEN technology. It shows the highest operational bandwidth and range 
resolution reported for any silicon-based radar system. The proposed circular 
polarization concept additionally increases the SNR by 6 dB when compared 
to conventional radar implementations. 

The process improvements in Infineon’s DOTSEVEN technology made 
it possible to implement an all-silicon terahertz 3D imager demonstrator 
presented in the section “0.5 THz Computed Tomography.” The main driving 
motor for this development was to showcase the potential of free-running 
triple-push oscillator source at around 500 GHz for high-quality absorption 
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measurements of hidden objects. The sources have been used together with 
custom asymmetric terahertz detectors to build a 3D terahertz CT system. 
This demonstrator is able to reconstruct 3D volume renders of hidden objects 
with an optically limited voxel resolution of around 2 mm x 2 mm. Contrary 
to previously demonstrated terahertz CT systems that typically use bulky and 
expensive III-V sources, the demonstrator is comprised solely of hardware 
fabricated in SiGe HBT technology from the DOTSEVEN project. 


6.2.1 240 GHz SiGe Chipset 


From an application perspective, the frequency upscaling above 200 GHz 
comes with a lot of benefits. A higher fractional bandwidth and a finer 
diffraction-limited spatial resolution both benefit the numerous applications 
ranging from high-data rate communication, RADAR imaging, and even 
spectroscopic characterization of the materials. The implementation of such 
systems requires wideband RF front-end components and wideband on-chip 
antennas. In this section, we present a generic 240 GHz Tx and Rx chipset 
which was developed under the DOTSEVEN project. This differential chipset 
operates in the quadrature mode, and the frequency of 240 GHz refers to the 
center frequency of the local oscillator (LO) signal, which was designed to 
be very wideband and tunable to make the chipset useful for a plethora of 
applications. 

The block diagrams for both the Tx and Rx are shown in Figure 6.19 
[Sarm16, Sarm16b]. The LO generation network consists of an active balun, 
a x16 frequency multiplier followed by a three-stage power amplifier (PA), 
and a differential 90° hybrid. The active balun is used for single-ended 
to differential conversion of the single-ended low-frequency signal (13.75- 
17.25 GHz) applied from an external frequency synthesizer which drives the 
succeeding x 16 stage. The x 16 frequency multiplier circuit forms the core of 
the LO generation network and it consists of four cascaded frequency-doubler 
stages, which are staggered tuned in frequency to increase the operational 
bandwidth [Sarm13, Sarm14]. The LO signal thus generated is amplified with 
a three-stage PA and then passed through a passive wideband 90° hybrid 
coupler to generate the quadrature signal. In the Tx, the quadrature LO 
signal is mixed with external quadrature IF signal to generate a wideband 
RF which is boosted in power with a four-stage PA and is then radiated 
through an on-chip ring antenna into a hyper-hemispherical silicon lens and 
subsequently to the free space. Similarly, at the Rx, the RF signal travels from 
the lens-antenna to a three-stage PA and subsequently to IF down-conversion 
mixers. The quadrature LO generation network at the Rx is similar to that 
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Figure 6.19 Block diagram of 240 GHz quadrature Tx (a) and Rx (b) chipset with identical 
on-chip ring antenna, after [Sarm16, Sarm16b]. 


of the Tx, and both Tx and Rx use on-chip 50-2 differential buffers for the 
external IF interface [Sarm16b]. The bandwidth is further optimized with a 
specially designed high-speed printed circuit board (PCB), which will also 
be discussed later. Each of these circuit blocks is discussed individually in 
the subsequent sections. We also demonstrate an end-to-end communication 
system with a measured data rate of 30 Gbps with an EVM of 26% and 50 
Gbps with an EVM of 29% for BPSK and QPSK modulation respectively, 
without applying any channel equalization or error correction techniques 
[Pedro17, Grz17b]. 


6.2.1.1 Wideband LO signal generation 

Let us start discussing the design details for this chipset, starting with the 
LO generation circuitry which forms the core for both the Tx and the Rx. 
In this design, the frequency multiplication technique is used instead of 
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a HF voltage-controlled oscillator (VCO) for LO-signal generation above 
200 GHz. This preference was made based on the following reasons: 


1. Frequency multipliers offer higher tuning range, higher usable band- 
width, and a flexible phase noise performance compared to the VCOs. At 
high-frequencies, the overall VCO tuning range is limited by the vector 
parasitics [Chil13]. 

2. A wideband tunable LO is also needed to realize a generic chipset 
which can be used across a spectrum of applications such as high-speed 
communication, material characterization, imaging, and frequency- 
modulated continuous wave (FMCW) RADAR [Sarm16, Grz16]. 

3. The often stated and major drawback of multiplier chains is that they 
have an overall higher power consumption. However, VCO-based LO 
sources also need additional frequency dividers, which increase their 
overall power consumption as well. A free running VCO is otherwise 
limited to the on-off keying (OOK) modulation with a poor spectral 
efficiency [Sarm14]. 


An expanded block diagram of the LO generation network is shown in 
Figure 6.20. The x16 multiplier chain is composed of four cascaded doubler 
stages, each of which is based on the common Gilbert-cell topology where 
the RF and LO ports are supplied with the same signal for in-phase multi- 
plication to extract the second harmonic. The three-stage PA is composed of 
pseudo-differential cascode topology which shall be discussed later. 

The LO generation circuitry for a generic transceiver chipset must fulfill 
the following performance requirements: 


1. To ensure the generic nature of the chipset, the LO signal must have a 
large bandwidth. While a system based on fixed or narrowband LO along 
with wideband mixers can amplifiers is suitable enough for communi- 
cation applications, other applications such as FMCW radar require a 
wideband tunable LO source [Grz16]. 
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Figure 6.20 Block diagram of LO signal source consisting x16 frequency multiplica- 
tion over four cascaded frequency doublers (D1—D4) and a wideband three-stage PA, after 
[Sarm 16]. 
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2. As mentioned above, the power-hungry nature of the multiplier chains 
is often a major concern. The design must limit the power dissipation of 
the multiplier chain to as low as possible. 

3. As the LO generation is based on harmonic extraction; the spectral purity 
of the multiplier chain is very important. Any spurious tones from the 
doubler stages reaching the mixer may start corrupting the IF thereby 
limiting the IF bandwidth. 

4. The mixers need a minimum LO drive power of around 1 mW (0 
dBm). Therefore, the generated LO signal power must be high enough to 
manage this power level along with the additional losses in the passive 
hybrid coupler. 


For these reasons, the LO generation sub-system was designed to generate 
a power of at least 5 dBm over a 3 dB bandwidth of 40 GHz [Sarm14]. To 
understand the design further, we need to have a look at the circuit description 
of each component individually. 


(A) x16 frequency multiplier 

(i) Gilbert-cell frequency doubler 

The circuit schematic for Gilbert-cell based unit doubler stage is shown in 
Figure 6.21. This topology is chosen due to an inherent differential operation 
and a high conversion gain (CG) as compared to a conventional class-B bias 
multiplier topology [Sarm11, Hung05, Oje11]. The capacitance Cin couples 
the differential input signal from the transconductance stage (Q1, Q2) to the 
switching quad (Q3-Q6). 

The inductors Lẹ and Le are part of the input and output matching 
networks. These are implemented on-chip with shielded microstrip lines 
in the top most metal layer with lengths /, and le, respectively. Shielded 
microstrip lines limit the electric field coupling between different parts of 
the circuits. The values of the matching network elements are also provided 
in Figure 6.21. As the input and output of each doubler stage from D1 to D4 
progressively shift to higher frequency, the design of each stage is optimized 
along the following guidelines [Sarm14, Sarm16]: 


1. The early stages D1 and D2 operate at lower frequencies and therefore 
the effect of parasitics is less pronounced. This allows for saving of 
some chip area by omitting the bias inductor L», entirely in favor of a 
resistor Ry. 

2. The transistor stages D1 + D2 are optimized for a high CG, which 
allows them to operate with a lower LO input power. This minimizes 
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Figure 6.21 Schematic of the Gilbert-cell doubler stage for frequency multiplication. The 
table mentions the passive values and the lengths l» and le for microstrip lines used to 
implement inductors Lp and Le, respectively. For D1 and D2, the base tuning inductance, 
after [Sarm16]. 


the LO leakage associated with the inherent asymmetry of Gilbert-cell 
based frequency doublers, which may otherwise produce the spurious 
harmonics at the output of the multiplier chain. 

3. The transistor sizing is determined at D4. The maximum transistor size 
is limited to 4 x (0.96 x 0.12) jum? as any further scaling will require 
an accurate synthesis of a very small Le (<10 pH) which is very difficult 
for an on-chip BEOL environment. 

4. For all the doubler stages, the transistor sizes are kept constant (same as 
D4) to maintain a sufficient interstage drive power. The large transistors 
benefit the stages D1 and D2 as they lower the inductance L, required to 
tune out transistor parasitic capacitance, saving further chip area. 


(ii) Interstage matching network 
The design of interstage matching network among the stages D1—D4 is very 
crucial for achieving the desired wideband operation. The matching network 
must be tuned to the second harmonic of interest from the preceding stage for 
a doubler operation. Also, higher order even harmonics must be sufficiently 
attenuated; otherwise, they would exist in the pass-band of subsequent stages. 
Another concern is the center frequency alignment between the stages. If 
the center frequencies of stages D1—D4 are perfectly aligned to consecutive 
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second-order harmonics with similar relative bandwidth, then the overall 
multiplier chain frequency roll-off becomes much sharper (as in the case 
of higher order filters) and thus the net bandwidth is reduced. The overall 
bandwidth BW overal of N cascaded stages is related to the bandwidth BW 
of single stage as BW overall = BW x vy 21/N — 1 [Ana04]. This implies 
that for a four-stage network, the overall 3 dB bandwidth corresponds to a 
mere 0.75 dB bandwidth of the individual stages, or the 3 dB bandwidth of 
the individual stages equates to the overall 12 dB bandwidth. 

One trick to mitigate this limitation is to use a staggered frequency tuning, 
where the stages are deliberately misaligned for an overall smoother fre- 
quency roll-off [Sarm13, Sarm14]. The detailed interstage matching network 
used in this design is shown in Figure 6.22. The doublers D1 and D3 are 
tuned higher while D2 and D4 are tuned lower and this resulted in a much 
smoother roll-off beyond the 3 dB point. As shown in Figure 6.23, the peak 
CG at the output of D1, D2, D3, and D4 are at the frequencies of 35, 55, 130, 
and 230 GHz respectively. For D1-D2 and D2-D3, the interstage matching is 
such that the optimum impedance is transformed at the output of D1 and D2 
at the second harmonic of interest. Additionally, it ensures that the impedance 
is low at the fourth and eighth harmonics for D1 (passband of D3 and D4) and 
at the fourth harmonic for D2 (passband of D4). For this design, the D3 output 
and the D4 input were matched to a 100-Q differential impedance for the ease 
of breakout characterization and interfacing with other circuits. The simulated 
(large signal) output impedance at the output of each doubler considering the 
loading of the succeeding stages is shown in Figure 6.24. The low impedance 
at the undesired harmonics ensures sufficient harmonic rejection. The stagger 
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Figure 6.22 Interstage matching between the doubler stages of the multiplier chain for LO 
generation. The tuning inductance Le connected to the collector output is not shown, after 
{Sarm16]. Other parameter values are mentioned in the table in Figure 6.21. 
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Figure 6.23 Simulated: (a) output power and (b) CG of the individual doubler stages. The 
doublers D1 and D3 are tuned higher, while D2 and D4 are tuned lower, and this resulted in 
an overall flat response, after [Sarm16]. 
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Figure 6.24 Simulated impedance at the collector outputs of D1-D4 derived from the large 
signal S-parameter simulations, after [Sarm16]. 


frequency tuning between the stages resulted in an overall simulated 3 dB 
bandwidth of 50 GHz (210-260 GHz) with a peak output power of —2.8 dBm 
at 240 GHz. The overall multiplier chain along with the active balun at the 
input consumes about 720 mW of power [Sarm16]. 


(B) Power amplifier (PA) 

The x16 frequency multiplier is cascaded with a three-stage PA. A detailed 
schematic of the single stage of this PA is shown in Figure 6.25. The circuit 
architecture is based on pseudo-differential cascode amplifier. The cascode 
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Figure 6.25 Schematic of the three-stage PA. The transistors Q1—Q4 have an emitter area of 
8 x (0.96 x 0.12) wm”, after [Sarm16]. 


topology is a popular choice in high-frequency amplifier design. The use of 
common-base stage as a load to the common-emitter transistor in a cascode 
reduces the Miller capacitance, therefore improving the reverse isolation, 
stability and the ease of impedance matching. Also, the voltage swing for the 
PA is limited by the base—collector breakdown voltage (BVcgo), which is 
larger than the collector-emitter breakdown voltage (BVcgo) in a common- 
emitter configuration. A higher voltage swing allows for a higher saturated 
output Psat from the PA [Kerh15]. 

A general design outline for the PA is provided in [Sarm13b]. The differ- 
ential configuration leads to a virtual ground at the base of the common-base 
stage of the cascode amplifier, which enables efficient and compact on-chip 
layout by relaxing the need for extensive on-chip decoupling capacitors. 
However, while a differential topology is expected to have a good common 
mode rejection, the design of a true differential amplifier requires an active 
tail current source. At frequencies reaching 200 GHz, the impedance of such 
current source becomes very low rendering it ineffective. Inductor-based 
current sources are also challenging as their low self-resonance frequency 
(SRF) limits the maximum synthesizable inductance, and the quarter wave 
transmission line-based inductors are inherently narrow band. Therefore, in 
this design, a pseudo-differential architecture is used where the common- 
emitter terminal is grounded and the base biasing is provided through the 
current mirrors. It becomes very challenging to implement the switching PAs 
at frequencies extending beyond 100 GHz due to the transistor parasitics 
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[Song15]. In this design, a low power gain of the device at the high oper- 
ating frequency (beyond fmax/2) necessitates the use of class-A biasing for 
the PA. 

In Figure 6.25, the emitter area of each of the transistors Q1—Q4 is 
8 x (0.96 x 0.12) um?. The emitter area and subsequently the power gain is 
therefore limited by the device parasitics. For larger device size, the required 
matched tuning inductor becomes too small (less than 10 pH) for on-chip 
implementation. The microstrip line-based inductor TL1, capacitor C1, and 
the coupled microstrip line CLIN2 are part of the output match, while CLIN1 
is part of the input match [Sarm16]. A decoupling capacitor of 100 fF is used 
at the common base of transistors Q3—Q4 (not shown here) [Sarm16c]. 

To maximize the power, the optimum load impedance at the collector 
node (Ropt) must ensure a simultaneous maximization of the voltage and 
current swing. This can be derived from the loadline analysis and depends 
on the breakdown voltage and the maximum allowable current density [Ref]. 
The output resistance of the cascode Ro should also be as high as possible to 
maximize the power delivered to the load [Ref]. However, due to the internal 
device parasitics, even when the reactance at the output node is tuned out, 
the output resistance shows a sharp reduction with frequency (Ro x 1/f?) 
as shown in [Sarm13b]. In this case the loadline impedance match becomes 
inefficient and therefore instead a conjugate matching is used in this design. 

For the multistage PA design, the device sizing is generally scaled from 
input to the output stages for handling progressively increasing RF power 
levels. However, this also requires a modified interstage matching network for 
each subsequent stage. In this design, the transistor sizes for all the stages are 
kept identical, which provides the flexibility to cascade multiple stages based 
on the gain requirements without a need for altering the interstage matching 
network. Identical stages also reduce the probability of frequency misalign- 
ment between different stages. Note that a four-stage variant of this PA is 
used in the Tx after the up-conversion mixer to increase the transmit power. 
For the interstage matching, the capacitor-coupled LC resonator technique 
is used [Shek06, Ana04, Nel32]. Here, the coupling capacitor C1 between 
the stages introduces an additional zero in the passband which improves the 
overall bandwidth. 

The multiplier chain with the three-stage PA was characterized separately 
in a breakout structure using WRO03 220-325 GHz ground-signal-ground 
(GSG) waveguide probes along with an on-chip wideband (210-280 GHz) 
Marchand Balun at the output. A DC-40 GHz GDG probe was used to 
provide a low-frequency (<20 GHz) input signal using an external frequency 
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synthesizer. An Erickson Calorimeter equipped with a WRO03 waveguide 
taper was used for the absolute output power measurement for two different 
input LO power levels (-10 dBm and 0 dBm), and the results are shown in 
Figure 6.26 [Sarm14, Sarm16c]. For a -10 dBm input LO power, the peak 
output power is 6.4 dBm at 230 GHz, and the 3 dB RF bandwidth is 50 GHz 
(215-265 GHz). The output power remains constant for an input LO power 
of 0 dBm below the input frequency of 13.75 GHz due to some spurious 
harmonic generation. The LO generation network also shows a phase noise 
degradation of around 25 dB and consumes about 0.74 W of DC power. 


(C) On-chip wideband quadrature coupler 

For the quadrature operation, the LO form the PA is provided to a 3 dB 90° 
coupler. A simplified geometry of the on-chip quadrature coupler is shown in 
Figure 6.27. It is implemented using three buried metal layers. The coupler 
exploits a combination of broadside coupling between the strip conductors 
located on different metallization layers and edge coupling between adjacent 
strip conductors located on the same layers. The design is sized to mini- 
mize the propagation loss and equalization of the propagation speed for all 
propagating modes to ensure maximum operation bandwidth. The coupler 
operates along with differential 100-Q grounded coplanar stripline feeds 
implemented on a thick top metal layer. The coupler was originally designed 
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Figure 6.26 On-chip power measurements for the standalone LO generation source with an 
Erickson calorimeter for input LO power of —10 dBm and 0 dBm, after [Sarm16c]. 
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Figure 6.27 Simplified metal-level multi-layer geometry for the differential quadrature 
coupler, after [Grz15]. 


for a duplex FMCW radar chipset [Grz15], and unlike the conventional cou- 
plers, the required circuit connections here favor the placement of Through 
and Coupled ports on the same side. Therefore, the EM structure is carefully 
optimized to minimize the layout asymmetry. 

When employed in the transceiver chipset, one of the input ports of the 
coupler is terminated with a matched 100-2 differential load impedance. This 
does not result in any additional losses, as the isolation between the input 
ports is more than —23 dB. The input return loss is less than —26 dB for a wide 
160-340 GHz bandwidth (Figure 6.28). The simulated phase and amplitude 
imbalance between the quadrature output ports within this frequency band 
are better than 3° and 1.6 dB respectively. 
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Figure 6.28 Simulated input match at all four ports of the quadrature coupler and isola- 
tion between the input ports for a differential excitation. All ports are referred to a 100-Q 
differential impedance, after [Grz15]. 
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6.2.1.2 Transmitter building blocks 

Other than the common LO generation network, an up-conversion mixer at 
the Tx converts an external IF signal into the RF signal which is then boosted 
with a four-stage PA before radiating through the antenna. The relevant design 
criteria for the Tx thus are wideband RF and IF operation, as well as sufficient 
output power. 


(A) Up-conversion mixer 

The up-conversion mixer is based on the double-balanced Gilbert-cell topol- 
ogy due to its inherent differential operation, LO rejection, and high CG 
[Voin13]. In the circuit shown in Figure 6.29, the switching quads (Q1-Q4, 
Q7-Q10) are driven by the quadrature LO signal while the transconductance 
stages (Q5—Q6, Q11—Q12) are driven by the quadrature IF signal. The center 
taps for both inductors Lp and Le are used for the DC biasing, where Lp 
forms the part of LO matching network. Both the inductors Lp and Le 
are implemented in TM1 (second from the top) metal layer as microstrip 
transmission lines [Sarm1 6c]. 

Since the LO drive power is limited, minimum-sized transistors were 
used at the switching quad stages to allow for a stronger switching and to 
reduce the parasitic capacitances. Here, the transistors with emitter areas 
2 x (0.96 x 0.12) um? and 1 x (0.96 x 0.12) um? were used for the 
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Figure 6.29 Up-conversion mixer schematic with additional buffer stages for wideband 
50-Q IF matching, after [Sarm16b]. 
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transconductance stages and the switching quads, respectively. Also, buffer 
stages with a shunt resistance of 50 Q were added for a wideband match to 
the external IF. For these buffer amplifiers, the transistors with an emitter area 
of 4 x (0.96 x 0.12) um? are used for linearity reasons. 

Note that the mixers are not characterized as separate breakouts, and it 
requires high-power LO above 200 GHz from external signal sources. To 
simulate this mixer, a 25 MHz, -5 dBm signal was applied at one of then 
IF channels. For a 240 GHz LO with 5 dBm power at the input of the 90° 
coupler, the simulated peak CG is 0.7 dB, Psat is -5 dBm, and OPjgp is 
-8 dBm. The simulated 3 dB IF and RF bandwidths are 38 GHz and 
50 GHz, respectively [Sarm16c]. 


(B) Four-stage PA 

The four-stage PA extends on the three-stage PA used in the LO generation 
network, with one identical additional stage. This PA was characterized as 
a separate breakout. At 230 GHz, the measured peak small-signal gain is 
26 dB, 3 dB RF bandwidth is 28 GHz, and S11 < —10 dB between 215 
and 255 GHz. For large signal measurements at 240 GHz, the PA provides a 
gain of 12.5 dB at compression, and the 1 dB compression points for input 
and output are -16.5 dBm and 3.7 dBm, respectively. Also, the measured 
Psat > 6 dBm for the 220-260 GHz frequency range. Note that since the 
drive power of Tx PA is large, the large signal bandwidth is more applicable 
to communication links, and this is usually larger than the small signal band- 
width. The measured peak power-added efficiency (PAE) at 240 GHz is 1% 
[Sarm 14]. 


6.2.1.3 Receiver building blocks 

Noise is the primary concern for the Rx sensitivity. Therefore, along with a 
wideband IF and RF, a low NF at the Rx is very much desirable. At the Rx, the 
three-stage variant of the PA is used as a preamplifier. The center frequencies 
of both the three-stage and four-stage PAs are similar, and therefore the 
probability of frequency misalignment between the Tx and Rx becomes very 
low [Sarm16b]. Also, the small signal bandwidth of the preamplifier decides 
the Rx bandwidth when it is driven with a low power signal (as in the case 
of a communication system with Tx and Rx separated over a wide distance 
resulting in a large path loss). A three-stage PA shows a larger small signal 
bandwidth as compared to the four-stage PA and therefore it is preferred at 
the Rx, ensuring a wideband operation. 
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(A) Down-conversion mixer 

The down-conversion mixer at the Rx end, also implemented with a double- 
balanced Gilbert-cell topology, is shown in Figure 6.30. The RF current at 
the transconductance stage (Q9-Q10) is fed by the input RF signal through 
the antenna and the pre-amplification PA, and this is shared between the I 
and Q switching quads (Q1-Q4 and Q5-Q§8, respectively), which in turn 
are supplied with the quadrature LO signal from the on-chip wideband 90° 
coupler. Common-emitter buffers with a series resistance of 50 2 are added 
at the IF outputs for a wideband match. The transistor sizes for the mixer 
and the buffers are identical to those of the up-conversion mixer at the Tx. 
The choice of load resistor Re determines the trade-off between the CG and 
the RC time constant-limited IF bandwidth at the collector output. The load 
resistor of 200 Q used in this design corresponds to a simulated 3 dB IF 
bandwidth of 35 GHz and a peak CG of —0.5 dB. For a 33 MHz IF and 240 
GHz, 5 dBm LO at the input of the quadrature coupler, the simulation predicts 
a 3 dB RF bandwidth of 52 GHz and a minimum NF of 14.2 dB. Also, both 
the simulated minimum LO power and IP; gp are around —2.5 dBm [Sarm16b, 
Sarm1 6c]. 


6.2.1.4 Antenna design 
The linearly polarized on-chip antenna in the Tx and the Rx chipset is topo- 
logically similar to the differential wire ring topology [Grz12]. It consists of 
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Figure 6.30 Schematic for the down-conversion mixer. Here, additional buffer stages were 
added to have 50-2 input impedance required for wideband IF matching, after [Sarm16b]. 
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two wire semi-rings connected along the center feed. For wideband operation, 
the feed is non-uniformly tapered using step-wise approximation [Grz17]. It 
is designed to illuminate a 9 mm-diameter silicon hyper-hemispherical lens 
through the chip backside. The lens reduces the influence of surface waves 
on the radiation efficiency and radiation patterns and inherently delivers 
a high gain to compensate for the high free-space propagation loss. The 
backside radiation offers significant advantages over the front-side radiation. 
The bandwidth is no longer limited by the distance of the ground plane 
(few um) as in the case of front-side radiation. The form-factor reduction 
is by a factor of 3.3 (,/11 for silicon), which is 39% less than in the case 
of front-side radiation (,/3.9 for silicon-dioxide). Moreover, the ability to 
mount external silicon lens of different sizes gives the ability to have flexible 
application specific directivity. The lens extension is chosen to be close to 
the elliptical position with extension to radius ratio of 34.4%. The antenna 
provides a differential impedance of 100 Q over a very wide bandwidth 
(S11 < -20 dB over 180-330 GHz) [Sarm16]. The simulated cross- 
polarization is below 20 dB for differential operation. By providing a low 
impedance (4-5 Q) for the common mode, radiation from the parasitic 
common-mode signal is minimized. It is also optimized for the minimization 
of mode conversion (differential to the common mode) and the simulated 
mode conversion is below 40 dB. The overall directivity of the antenna with 
the lens is 26.4 dBi at 240 GHz. 


6.2.1.5 Packaging and high-speed PCB design 

The chip-on-board (COB) technology is used for both Tx and Rx packaging. 
The entire chip-on-lens assembly is accommodated inside a recess on a PCB, 
and chip pads are connected to the PCB bond pads through the wirebond 
process. A heat sink with direct thermal contact to the silicon lens is also 
added to improve the heat dissipation away from the chip (Figure 6.33). 

The high-frequency IF signal is the major concern while designing the 
PCB. The PCB material should have a low dispersion and low dielectric 
loss. Therefore, materials with low relative permittivity and low loss tangent 
are favored. Here, the ROGERS 4350B material from Rogers Corporation is 
used. This material is designated for high-frequency applications and it shows 
a permittivity €p and a loss tangent tand of 3.66 and 0.0037, respectively 
over DC-20 GHz frequency range. The choice of PCB thickness is a trade-off 
between the mechanical stability and the maximum allowed line width for 
the lowest impedance microstrip lines. Since the differential and quadrature 
IF routing lines must be highly symmetrical, very large trace widths cannot be 
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tolerated within a reasonable PCB area. Here, a PCB thickness of 
0.388 mm is found to be optimum, with a 50-Q microstrip line corresponding 
to a 0.718 mm width. 

The wirebond connecting the chip to the PCB also limits the IF band- 
width. Therefore, a phase linear wideband matching filter needs to be 
implemented on the PCB. While the Bessel filters show the most linear 
phase response, the feasible component values limit the choice to maximally 
flat or Butterworth filter topology which still provides a more linear phase 
response as compared to the Chebyshev filters. For this purpose, an 8-section 
lumped parameter LC filter is synthesized using Richard transformation on an 
iterative basis [Pozar09], which takes into account the wire bond inductance 
(Figure 6.31) [Sarm16b]. 

Figure 6.32 shows the results from the full EM simulation of the filter. 
For this, the PCB and the chip ground pads are included to accurately model 
the ground return current. This simulation predicts an insertion loss ($21) of 
—0.5 dB in the passband with a 3 dB bandwidth of 15 GHz. The input return 
loss (S11) is less than — 10 dB for up to 14 GHz and the group delay variation 
is less than 10% up to 9 GHz. 


6.2.1.6 Tx and Rx characterization 
The chip-micrograph of the Tx and Rx chipset with the on-chip antenna 
is shown in Figure 6.34 [Sarm16b]. For on-wafer characterization, the Tx 
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Figure 6.31 Schematic of the low-pass filter implemented on a ROGERS 4350B PCB 
material with a thickness of 0.338 mm. The stepped impedance filter low-pass filter is 
implemented with microstrip lines on the PCB, after [Sarm16b]. 
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Figure 6.32 The full-EM simulation results from the filter, after [Sarm16c]. 
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Figure 6.33 Lens mounted and packaged chip for Tx/Rx module. 


and Rx have an auxiliary balun instead of an on-chip antenna, and are not 
shown here. A WRO3 GSG waveguide probe for the 220-325 GHz band is 
used to measure the RF output from the Tx and to supply RF to the Rx. 
Using the Short-Open-Load (SOL) calibration, the loss due to the probe and 
the waveguide is estimated to be 7.5 dB in this band. A GSG probe (DC- 
40 GHz) is used to couple the low-frequency LO signal with —10 dBm 
output power to the chip. A 25 MHz signal from a function generator is along 
with external 90° and 180° hybrids are used for the differential quadrature IF 
signal generation at the Tx side, and the output RF power is measured using 
an Erickson calorimeter. For the Rx characterization, a WRO3 VNA extension 
module is used in the transmit mode as an RF source, and its output power 
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Figure 6.34 Chip-micrograph of the Tx and Rx chipset. The total chip area including the 
pads is (a) Tx: 1.613 mm? (b) Rx: 1.522 mm?. For the on-wafer measurements, an auxiliary 
balun with an estimated 2.5 dB loss has been added at the output, after [Sarm16b]. 


is calibrated using the Erickson calorimeter. The down-converted IF power at 
33 MHz is measured with the spectrum analyzer. 

The on-wafer characterization results are shown in Figure 6.35 
[Sarm16b]. The Tx can deliver up to 6 dBm output power at 240 GHz and 
the 3 dB bandwidth is 40 GHz. The peak CG for the Rx is 11 dB and the NF 
is 15 dB while the 3 dB RF bandwidth is 28 GHz. The NF is calculated using 
the direct method [Oje12] under the assumption that the input noise floor is 
—174 dBm/Hz (thermal noise at the room temperature). 

For the IF bandwidth characterization, the packaged Tx and Rx (with on- 
chip antenna and the lens) are placed back to back with a distance of 90 cm. 
The IF inputs of the Tx and the Rx are connected to a VNA and swept IF 
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Figure 6.35 On-chip characterization results for (a) the Tx, and (b) the Rx, after [Sarm16b]. 
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measurements are done for a fixed LO input of 240 GHz. The results indicate 
a 6 dB IF bandwidth of 13 GHz as shown in Figure 6.36 [Sarm16b]. 


6.2.1.7 Ultra-high data rate wireless communication 

The measurement setup for ultra-high data rate wireless communication 
system is shown in Figure 6.37 [Pedro17, Grz17b]. The fully integrated and 
packaged Tx and Rx modules were separated by a link distance of 1 m. The 
differential inputs of the Tx are connected to an arbitrary waveform generator 
(Tektronix AWG70001A) and the IF outputs of the Rx are connected to a real- 
time oscilloscope (Tektronix DPO77002SX) using phase-matched cables. 
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Figure 6.36 Measurement results from the IF bandwidth characterization over a link dis- 
tance of 90 cm. For this measurement, the LO is fixed at 240 GHz and the measured 6 dB IF 
bandwidth is 13 GHz, after [Sarm16b]. 
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Figure 6.37 Measurement setup for the high data rate wireless communication with arbi- 
trary waveform generator (Tektronix AWG70001A) and real-time oscilloscope (Tektronix 
DPO77002SX), after [Pedro17]. 
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The external LO inputs for both Tx and Rx are driven by the same 15 GHz 
signal from an external synthesizer with a power splitter. 

With no channel equalization applied, maximum transmission speeds 
of 30 Gbps with an EVM of 26% and 50 Gbps with an EVM of 29% 
were demonstrated for BPSK and QPSK, respectively, using PRBS9 binary 
sequence. To increase the reliability of the link with a smaller EVM, a second 
test was performed for reduced transmission speeds, resulting in an EVM of 
11% for 25 Gbps and an EVM of 22% for 40 Gbps for BPSK and QPSK, 
respectively (Figure 6.38). The limitations in the board bandwidth as well 
as in the Rx RF/LO bandwidth influence the achievable EVM for the tested 
modulation speeds. 


6.2.2 210-270 GHz Circularly Polarized Radar 


Contrary to other imaging techniques, high-resolution radar-based imagers 
are capable of providing significant improvements in the imaging quality 
thanks to their range-gating capabilities [Coop08, Lian14, Dick04, Quas09, 
Graj15]. Similar to general-purpose transceivers operating beyond 200 GHz, 
they feature low integration and are thus not commonly used because they 
become expensive and space-inefficient [Coop08, Esse08, Bryl13]. Con- 
sidering the increasing popularity of radar sensors in various high-volume 
consumer and industrial markets such as health care [Lil3], autonomous 
navigation in robotic platforms [Chen08, Moal14], non-destructive testing 
[Karp05], and automotive systems [Maur11], the implementation costs with 
low weight and small form-factor are more and more relevant. By suitable 


Figure 6.38 Measured eye diagrams for: 25 Gbps BPSK modulation (left); 40 Gbps QPSK 
modulation (right), after [Pedro17]. 
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combination of microelectronic packaging with silicon technologies, high- 
integration levels of the complete radars become a reality and will develop 
in the future into the solution of choice for such sensors. Currently, most of 
Si-integrated radars are operated below 100 GHz [Shen12, Maur11] in view 
of the technology limitations. 

Within the frame of DOTSEVEN project, a complete highly integrated 
FMCW homodyne monostatic radar system operating around 240 GHz with 
a 60 GHz bandwidth and a state-of-the-art 2.57 mm-range resolution was 
developed. Its RF front-end is implemented in the form of a single chip 
in a 0.13-um SiGe HBT technology with fr/fwax of 300/450 GHz from 
IHP. To facilitate a low-cost packaging scheme, the chip further includes 
a wideband lens-integrated on-chip annular-slot antenna [Grz15, Grzyb15] 
and is wire-bonded onto a low-cost FR4 printed-circuit board. Despite the 
expected lower sensitivity [Graj15] of the homodyne monostatic architecture, 
this radar topology was selected for implementation due to low costs of the 
accompanying baseband chain and the highest possible integration level on 
a single chip. As opposed to classical linearly polarized monostatic radar 
front-ends [Jahn12, Jaes13, Jaes14], this radar employs circular polarization 
[Kim05, Statn15] for multiple reasons. The first reason is the absence of on- 
chip circulators separating Tx and Rx paths. This issue is typically solved 
by using equivalent quasi-circulators made of on-chip directional hybrid 
couplers such as rat-race [Jahn12]. Such a solution suffers from an excessive 
6 dB loss in SNR because of some additional power loss in the terminating 
loads [Jahn12, Kim05]. As shown in [Statn15], this loss can be gained back 
by means of a circularly polarized architecture. Circular polarization may 
further increase detection probability in the presence of wave depolarization 
[Moal14, Nash16] or reduce the influence of Rx jamming while operating 
multiple radar sensors simultaneously [Lian07] and of ghost targets in indoor 
environments [Moal14]. 

Figure 6.39 presents the radar chip micrograph and its block diagram. Cir- 
cular polarization is provided by a broadband annular-slot antenna supporting 
two orthogonal polarizations [Grz15, Grzyb15] driven from a wideband 
quadrature coupler [Grz15]. The transceiver is implemented in a fully dif- 
ferential configuration. To achieve the fundamental radar operation in a wide 
frequency range of 210-270 GHz, the LO-generation path is realized with the 
x16 multiplier-chain architecture because of the missing appropriate tuning 
varactors devices. Both Tx and Rx paths share the same LO-generation chain 
which is driven around 13.1-16.9 GHz at a power level of around 0 dBm. 
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Figure 6.39 Radar transceiver chip implemented in 0.13-um SiGe HBT technology and 
operating at 210-270 GHz: (a) chip micrograph, (b) block diagram; after [Stat15]. The chip 
size is 2.9 mm x 1.1 mm. 


The LO drive is provided from the printed circuit-board level as a single- 
ended signal which is then converted to a differential topology by means 
of an active balun in front of the multiplier chain. The x16 multiplication 
factor was selected in view of the limited RF performance of the regular 
mm-long wire-bonded interconnects. The multiplier chain comprises four 
cascaded Gilbert-cell frequency doublers [Sarm16] which inherently provide 
differential operation. The output signal from the LO-path is equally split 
by a novel differential Gysel power divider to drive both Tx and Rx paths. 
Compared to the Wilkinson divider, the chosen Gysel power-splitter is capa- 
ble of providing an improved isolation between its two output ports at the 
operation frequency. Each of two outputs drives a four-stage power amplifier 
with a small-signal gain of 14 dB and a Psat of around 7 dBm [Sarm16]. 
One of the amplifiers is connected directly to the Tx port of the circularly 
polarized antenna whereas the other drives the down-conversion mixer. Due 
to the similar impedance range at the inputs of both amplifiers, the power 
splitter imbalance can be minimized. Furthermore, the power amplifiers are 
useful in providing an improved TX-to-RX isolation from the LO-chain side. 
The down-converting mixer is operated fundamentally and implemented as 
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a double-balanced Gilbert-cell topology. In order to minimize the influence 
of excessive mixer noise on the Rx NF, the receive signal from the antenna 
output port is pre-amplified with a three-stage PA. Here, the power amplifier 
was used instead of a regular LNA [Statn15] to maximize both the radar 
operation bandwidth and the linearity with similar noise performance metrics 
to that achievable with a silicon-integrated LNA at the operation frequency 
[Statn15]. Please note that the Rx linearity is crucial for the radar operation 
because of its monostatic architecture suffering from the TX-to-RX leakage. 
From previous measurements of the similar Rx paths [Sarm16], an input- 
referred 1 dB Rx compression point of around —9 dBm sets a reference 
value for finding the minimum required antenna input match to avoid Rx 
compression. More advanced adaptive leakage power cancellation techniques 
[Brook05, Beas90] can be considered in the future to improve the radar 
performance. 

For free-space operation, the transceiver chip is mounted on the back of 
a high-resistivity hyper-hemispherical silicon lens with the primary on-chip 
feed antenna aligned with the lens optical axis. Then, the entire chip-on-lens 
assembly is in turn mounted in a recess of a regular FR-4 PCB surrounded 
by a large metal plane, as shown in Figure 6.40. The lens volume is crucial 
for thermal control for the chip dissipating around 1.6 W. Here, the heat is 
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Figure 6.40 Complete radar transceiver module with a copper heat sink and a 9 mm silicon 


lens; after [Sta15]. The incorporated IR-image indicates that the chip-on-lens assembly is at 
around 29°C. 
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transferred to a copper heatsink through the lens attached to the PCB bottom 
side which stays in direct contact with the lens. The lens further allows an 
in-door operation of the complete radar module with no additional external 
optical components because of a significant increase of the antenna effective 
gain. Moreover, the pointing-direction errors present in a lens-integrated 2- 
antenna system (bistatic radar) radiating at an angular offset from the lens 
optical axis are eliminated in the monostatic radar architecture relying on 
a single on-chip antenna aligned with the lens center. The current radar 
implementation features a 9 mm-diameter hyper-hemispherical lens with a 
1.3 mm extension to maximize the antenna directivity [Fili93]. Such a lens 
size provides enough volume for cooling the chip to 29°C, as shown in 
Figure 6.40. 

The integration of a high-performance antenna on a silicon chip is one 
of the most challenging tasks because the typical cross section of a silicon 
chip comprises only few metal layers embedded in a low-refraction-index 
BEOL (Back-End-of-Line) dielectric stack, typically only few micrometers 
thick, which is located on the top of a lossy bulk silicon substrate. With 
such a dielectric stack, there are basically only two options for implementing 
on-chip antennas. The first and the most straightforward approach is to 
use a ground-plane support between the BEOL stack and the substrate to 
realize the classical microstrip-type antenna radiating to the top of a silicon 
chip. This solution, however, results in low radiation efficiency and narrow 
operation bandwidth [Jaes13, Jaes14]. With the ground-support eliminated, 
electromagnetic waves start penetrating the complete volume of a lossy and 
electrically thick substrate launching various parasitic modes such as surface 
waves. This leads to a very poor prediction of radiation characteristics over 
a large RF bandwidth, parasitic inter-element coupling, and low radiation 
efficiency. An alternative solution is to apply the so-called lens-integrated 
on-chip antennas for mm-wave and THz applications [Fili93, Jhal4] which 
was the preferred option also for this work. 

For this particular case, an isolation between the transmit path and the 
receive path over a wide operation frequency range is additionally required 
which is provided by a circularly polarized antenna. Circular polarization is 
achieved by suitable combination of a broadband differential quadrature cou- 
pler [Grz15] and a novel dual-polarization circular-slot antenna [Grzyb15]; 
as shown in Figure 6.41. Here, a left-handed polarization (LHCP) is imple- 
mented in the Tx path (‘Tx out’ in Figure 6.41), whereas the receive signals 
reflected in free-space at an odd number of times are directed to the receive 
port (‘Rx in’ in Figure 6.41) as RHCP waves. 


6.2 Millimeter-wave and Terahertz Systems 279 


TX Port RX Port 
PCB with Radar 


Chip om 


-S 


9-mm Silicon Lens 


_—— Vacuum Surrounding 
3-D Simulation Model 


3-D EM simulation model of the packaged radar 


Figure 6.41 A 3-D EM simulation model of the packaged radar module with a silicon chip 
mounted on the back of a 9-mm lens; after [Sta15]. The chip-on-lens assembly is placed inside 
a rectangular recess in the PCB and surrounded by a large ground plane. The slot antenna with 
the differential quadrature coupler in the BEOL dielectric stack of a silicon chip is shown in 
the magnified view. The transmit port and the receive port are denoted as “Tx out’ and ‘Rx in’, 
respectively. 


The implemented slot antenna is capable of supporting two orthogonal 
polarizations launched by two orthogonal pairs of patch probes located along 
the slot circumference (inset of Figure 6.41). The antenna is embedded in 
a 12-um thick BEOL stack with seven aluminum layers on top of a 150- 
um thick lossy substrate with a bulk resistivity of 50 Qcm. Its transmit and 
receive ports are interconnected to the corresponding differential outputs of 
the quadrature coupler with two intermediate 900-um long T-line sections 
implementing the mode conversion from a microstrip line configuration on 
the antenna side to a grounded-coplanar stripline feed on the coupler side. 
A 10 dB-defined input-impedance operation bandwidth of the standalone slot 
antenna is very broad and spans between 150 GHz and 500 GHz. The quadra- 
ture coupler driving the antenna (inset of Figure 6.41) is realized by exploiting 
both the side coupling between two adjacent strips and the broadside coupling 
between two other strips located on different metallization layers. A total 
coupling length is only 110 um. In order to ensure broadband operation of the 
hybrid coupler [Grz15], its layout is implemented by means of three buried 
metal layers of the BEOL stack to equalize propagation speeds of the relevant 
modes. For the considered radar operation bandwidth of 210-270 GHz, 
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the simulated phase and amplitude imbalance of the coupler are within 
t0.75° and 0.2 dB, respectively, whereas the isolation between the Tx and 
Rx ports is superior to —23 dB within 160-340 GHz. A radiation efficiency 
of around 62-67% within 200-300 GHz was simulated for the complete 
circularly polarized antenna, comprising the slot radiator and the coupler, 
radiating through a 150-um thick lossy Si-substrate into a silicon half-space 
[Tong94]. The parameters 3.8 x 10’ and 0.02 were assumed for metal 
conductivity and dielectric loss tangent of the BEOL stack, respectively. The 
complete packaged chip-on-lens assembly, as shown in Figure 6.41, was 
further EM-simulated in the transmit mode to study the leakage between 
the Tx and Rx ports through the antenna path in the presence of reflections 
at the lens aperture. This leakage may potentially cause nonlinear effects 
in the Rx, an increase in the noise level, or even lead to the Rx saturation 
[Brook05, Stov92, Ondr81, Pipe95]. From Figure 6.42, the simulated TX-to- 
RX leakage and return loss at the TX and RX ports are better than —21 dB and 

—23 dB, respectively. Considering the previously estimated Rx P; gp compres- 
sion point and the Tx output power, such antenna isolation is not expected to 
result in the Rx compression. Moreover, the simulated radiation efficiency of 
the entire packaged lens-integrated antenna is only a few percent lower than 


that for the corresponding silicon half-space because of the transmit mode of 
operation. 
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Figure 6.42 Simulated return loss at the TX port and the TX-to-RX leakage for the complete 
chip-on-lens packaged assembly from Figure 6.41; data from [Sta15]. 
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The complete radar module was characterized in free-space exploiting 
the Friis-transmission equation in the antenna far-field zone. The following 
key parameters were measured: radiation patterns with antenna directivity 
and axial ratio, transmitted power, and Rx CG and NF. The measurements 
were conducted for both operation modes: transmit and receive. The antenna 
directivity in the TX and the RX operation mode was measured to be 25.8- 
27.8 dBi and 25.9—27 dBi, respectively, for the radar operation frequency 
range of 210-270 GHz. An exemplary chosen radiation pattern at 270 GHz 
for the radar module in the transmit mode is shown in Figure 6.43. The pattern 
shows good beam rotational symmetry with a side-lobe level of around — 
17 dB. An axial ratio of 1-1.45 and 1—1.35 for the transmit and the receive 
mode of operation, respectively, was further measured at boresight for 210— 
290 GHz. The frequency-dependent radiated output power is presented in 
Figure 6.44, where a different power level for two module orientations (see 
Figure 6.39 for orientation definition) can be recognized. This difference is 
predominantly related to the non-ideal antenna axial ratio. A peak radiated 
power is around 5 dBm and a —10 dB-defined RF bandwidth is 46 GHz 
(217-263 GHz) [Sta15]. 
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Figure 6.43 Azimuthal view of the antenna co-polar radiation pattern at 270 GHz for the 
radar module operating in the transmit mode. 
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Figure 6.44 Frequency-dependent radiated power; data from [Sta15]. Both the total power 
and the power levels for two orthogonal antenna orientations (‘A-plane’ and ‘B-plane’) are 
plotted. For plane orientation, please, refer to Figure 6.39. 


The CG and the NF of the radar module operating in the receive mode 
were measured with the pre-calibrated reference power source from OML. It 
should be noted that this characterization was conducted in the presence of 
leakage from the Tx chain operating simultaneously, resulting in the noise 
floor increase of the receive path. The NF was calculated indirectly from the 
measured CG and the noise power spectral density at the radar baseband out- 
puts because of missing noise standards in the lab equipment at the operation 
frequency. The frequency dependence of the measured CG and NF is plotted 
in Figure 6.45. A peak CG of 12.1 dB with the corresponding minimum NF 
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Figure 6.45 Noise figure and conversion gain of the radar module for an IF frequency of 
33 MHz; data from [Sta15]. 
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of 21.1 dB was measured. A —10 dB-defined RF operation bandwidth for the 
radar receive mode is around 46.3 GHz (214.8-261.1 GHz). 

Besides the 210-270 GHz transceiver module, the complete radar system 
includes an external in-house-developed linear-frequency chirp generator, a 
set of differential IF amplifiers with a data-acquisition unit, and a MATLAB 
code for signal post-processing. Its architecture is shown in Figure 6.46. 

For maximum range resolution [Meta07], fast and wideband sawtooth up- 
chirps of high linearity from 13.1 GHz to 16.9 GHz driving the input port of 
the radar RF module are first generated. The chirps can be made as short 
as 100 us but the overall system was optimized for a chirp duration of 2 
ms. The chirp generator relies on a hybrid architecture consisting of a direct 
digital synthesizer (DDS), a phase-locked loop (PLL), and a VCO. Here, 
the chirp signal from the DDS serves as a reference for the integer-N PLL 
which then up-converts the reference DDS frequency to the required 13.1- 
16.9 GHz. The chosen chirper topology combines the benefits of both the PLL 
[Zhiy13] and the DDS in terms of fast-chirp generation and spectral purity. 
In particular, the spurious tones from DDS are suppressed by the PLL loop- 
filter. In comparison with a fractional-N PLL, the DDS-based architecture 
offers finer frequency resolution [Stel05]. In the current implementation, the 
chirper is realized as a set of off-the-shelf PCB-mounted components with 
the DDS circuit clocked at 1 GHz from the signal generator Agilent E8257D. 
In the PLL, a high tuning range (13—20 GHz) VCO from Sivers IMA is used. 
In order to minimize the overall PLL phase noise, a comparison frequency of 
the phase-frequency detector was selected to be as high as possible, resulting 
in the PLL loop division ratio, N, of 192 (4 x 48). A third-order active filter 
with a 1.8 MHz low-noise operational amplifier implements the PLL loop 
filter. With the aid of a behavioral simulation model, the PLL loop bandwidth 
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Figure 6.46 Architecture of the complete radar under test with a metallic plate located at a 
distance R; after [Sta15]. 
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was set to 479 kHz with an appropriately high phase margin of the open-loop 
transfer function (around 58°) for minimum total integrated phase noise of 
the chirp generator. 

In Figure 6.47, an exemplary chosen phase noise at the frequency chirper 
output around 15 GHz is presented. The total jitter integrated from 10 Hz 
to 100 MHz is 2.62° rms. The linearity of the generated frequency ramp was 
verified by means of Hilbert transform [Thro84, Grei93] but only indirectly at 
the divider output due to limitations in the lab equipment. With this approach, 
a complex-value analytic signal is first obtained from the real-value time- 
domain train acquired at the radar IF output. The instantaneous phase of such 
an analytic signal is then compared with the ideal phase trajectory of a linear 
frequency chirp and the phase deviation between the two can be computed. 
The corresponding frequency error results from the rate of change of this 
phase deviation and its root-mean square value is defined over the chirp 
duration time. In particular, for a frequency ramp of 210-270 GHz swept 
over 2 ms, the rms frequency error is below 9 kHz. 

In the current radar implementation, beat signals at the IF ports are digi- 
tized by an external 16 bit sampling card from National Instruments and then 
post-processed using MATLAB routines. The card sampling rate is limited 
to 2 MHz which results in 4,000 samples for 2 ms long chirp period. For the 
consecutive experiments, a Hanning window will be predominantly applied 
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Figure 6.47 Phase noise of the frequency chirp generator at 15 GHz driven from the 
frequency synthesizer Agilent E8257D at 1 GHz; after [Stal5]. A frequency of 15 GHz 
corresponds to 240 GHz for the up-converted signal at the radar RF output. 
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in the FFT data post-processing as a fair compromise between selectivity 
and resolution [Harr78]. For a 2 ms long frequency chirp, this window 
corresponds to an equivalent noise bandwidth of 750 Hz. 

In order to minimize the influence of the RF front-end non-idealities on 
the performance of the complete radar system, a 2-step calibration procedure 
was conducted with the aid of a metallic plate as a single-target reflector at 
a distance of 80 cm from the radar module, as shown in Figure 6.46. Such a 
target shows the 1/A? frequency-dependent radar cross section, where A is the 
free-space wavelength. The first step of the calibration aims at removing the 
influence of the close-in returns resulting from the limited isolation between 
Tx and Rx paths which do not depend on the imaged objects and appear as 
low-frequency IF signals at the Rx output port. This step does not require any 
reference target to be placed in front of the radar antenna. However, to mimic 
this ‘no-target’ radar response in the presence of close-proximity reflections 
in the lab environment coming from insufficient absorber attenuation, the 
metal plate was tilted by 45° to the boresight of the radar antenna. In the next 
step, the metallic plate was set back to its perpendicular position with respect 
to the radar boresight and the Hilbert transform on the acquired calibration 
signal from the plate was applied to calibrate the influence of parasitic 
phase and amplitude modulations resulting from the non-ideal RF front-end 
characteristics. A frequency dependence of the normalized power received 
from the plate after the Hilbert-transformed time-domain IF calibration train 
is plotted in Figure 6.48. Considering the 1/\? frequency-dependent radar 
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Figure 6.48 Normalized frequency-dependent power received from the calibrating metallic 
plate after the Hilbert-transformed time-domain IF calibration train; data from [Sta15]. 
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cross section of the metallic-plate, a —10 dB-defined bandwidth of around 
45 GHz can be estimated for the complete radar transceiver combining the 
characteristics of both Tx and Rx paths. 

For verification purposes of the applied calibration procedure, the beat- 
signal time-domain trains were further acquired for different positions 
of the metal plate. Exemplary, the calibrated frequency-dependent radar 
response to the plate at a distance of 60 cm is presented in Figure 6.49. 
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Figure 6.49 Radar response to a metallic plate located at a distance of 60 cm from the 
radar module; data from [Stal5]. (a) Magnitude response after amplitude calibration only, 
(b) instantaneous beat frequency after the amplitude and phase corrections. The beat frequency 
de-embedded from the peak in the IF power spectrum of the return signal is 124.1 kHz (see 
also Figure 6.50). Both frequency and time units are shown due to duality of the sweep time 
and the actual RF frequency for a linear FMCW radar. 
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From the amplitude-corrected magnitude response, it can be noticed that 
the envelope of the acquired beat signal is almost constant in the frequency 
range of around 60 GHz but it starts deviating below 220 GHz and beyond 
260 GHz. It was verified that two parasitic harmonics with the x14 and 
x18 multiplication factor leaking from the multiplier chain-based LO path 
are mainly responsible for this behavior which could not be appropriately 
calibrated. Similarly, the extracted instantaneous beat frequency after ampli- 
tude and phase correction steps is influenced by the same harmonic spurs. 
These harmonic distortions and not the Rx noise floor are primarily limiting 
the achievable spurious-free dynamic range (SFDR) and the operational 
bandwidth of the currently implemented radar. 

The corresponding FFI-computed IF power spectra of the calibrated 
beat signal for two RF bandwidths of 60 GHz and 45 GHz are shown in 
Figure 6.50. The Hanning window was applied in the computation for low 
spectral leakage and large dynamic range (DR) [Harr78]. Here, similar to the 
plots from Figure 6.49, the influence of x14 and x18 harmonic spurs at 109 
kHz and 140 kHz, respectively, located around the main peak at 124.1 kHz 
can be recognized. Please note that for a reduced bandwidth of 45 GHz, the 
radar SFDR achieves around —40 dBc. 

From Figure 6.50, the achievable radar range resolution can be further 
extracted by means of the so-called point spread function (PSF). For a 
60 GHz operation bandwidth, a theoretical range resolution of c/2B = 
2.5 mm can be calculated for the currently implemented radar, where B is the 
RF bandwidth and c is the speed of light. This theoretical resolution is, how- 
ever, of limited practical use. In practice, the ability of distinguishing between 
two close-proximity targets is more relevant. In this case, the main-lobe 
full-width at —6 dB of the PSF becomes the parameter of interest. For a rect- 
angular weighting function promising the best resolution but with the highest 
side-lobe level of —13 dB, it results in 3 mm. With the Hanning window, 
commonly applied in imaging for low spectral leakage, this number becomes 
2c/2B = 5.0 mm [Harr78]. A main-lobe full-width at —6 dB of 5.14 mm 
was extracted for the radar implemented here operating with the maximum 
considered bandwidth of 60 GHz after amplitude and phase corrections and 
the Hanning window applied, which is close to a theoretical limit of 2.5 mm. 

A very simple 2-D scanning optical setup comprising two elliptical col- 
limating and refocusing mirrors, as shown in Figure 6.51, was applied to 
demonstrate the radar 3-D imaging capabilities. Here, the scanned objects 
were placed in the focal point of one of the mirrors. A lateral optical 
resolution of around 1 mm was estimated for this setup with the aid of a 
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Figure 6.50 The FFT-computed IF spectrum of the calibrated beat signal corresponding to 
the metallic plate spaced by 60 cm from the radar module for two different operational RF 
bandwidths: (a) 60 GHz and (b) 45 GHz. For comparison purposes, the chirp duration was 
varied for both bandwidths to arrive at the same beat frequency of 124.1 kHz. For 60 GHz, it 
was set to 2 ms whereas for 45 GHz it was reduced accordingly. The influence of harmonic 
spurs can be identified around 109 kHz and 140 kHz. 


simple aluminum pin-type heatsink as a resolution target. For a set of the con- 
secutive imaging experiments, the previously mentioned Hanning weighting 
function was replaced by the Hamming window offering a slightly improved 
resolution of 4.65 mm [Harr78] for the full 60 GHz operation bandwidth. 

As a scanning object, a 12 cm x 6 cm large cardboard box with a hidden 
blister pack of drugs and two missing tablets was chosen. The object was 
meander-scanned in both directions (X and Y), as sketched in Figure 6.52, 
and the acquired data was post-processed using 3-D data matrix routines from 
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Figure 6.51 2-D optical scanning test setup for demonstration of the radar 3-D imaging 
capabilities. A total path length between the radar module and the object is around 780 mm. 


MATLAB. Each IF signal burst took 2 ms and was sampled at two MSPS. The 
range profile was sampled with a resolution of 0.5 mm, whereas the pixel 
lateral pitch was set to 0.25 mm after interpolation, resulting in an image size 
of 200 x 480 x 240 voxels for a range-gated distance of +50 mm around the 
image center position. 

Figure 6.52 presents the exemplary chosen 2-D image of the normalized 
power received for an object-to-radar distance of 780 mm altogether with the 
range profiles for two lateral positions across the cardboard which correspond 
to the present and the missing tablet. It can be noticed that the signal returns 
correlating with the positions of the lidding seal of aluminum foil, the plastic 
cavity, and the cardboard box can be identified due to the radar appropriate 
range resolution and finally the missing tablets can be detected. The corre- 
sponding 3-D surface reconstruction of the scanned object can be found in 
Figure 6.53. Here, the image is formed with a peak-search algorithm which 
identifies the positions of the highest reflected power for each lateral position 
and the color-scale represents the normalized received power. 


6.2.3 0.5 THz Computed Tomography 


Increasing the transistors fmax deep into the terahertz frequency range does 
not only lead to a significant performance improvement for mm-wave and 
sub-mm-wave circuits, it also contributes to the vision of closing the THz 
gap with silicon-based circuits. Traditional compound semiconductor-based 
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Figure 6.52 3-D imaging experiment with the implemented radar module. (a) Cardboard 
box with a blister pack of drugs with two missing tablets as the scanned object. (b, c) 2-D 
scan of the normalized power received for an object-to-radar distance of 780 mm altogether 
with the range profiles for two different X—Y positions across the cardboard. The positions 
correspond to the present and the missing tablet, respectively. Both acquired range profiles 
show a DR of around 50 dB. 
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Figure 6.53 3-D surface reconstruction of the object from Figure 6.52(a) after a peak-search 
algorithm. The scan was appropriately range-gated (770 mm < Z < 788 mm) to eliminate 
the influence of multi-path reflections inside the cardboard box and the plastic cavities of the 
blister pack as well as reflections from the front of the box. 


THz imaging systems tend to be bulky and expensive and thus suffer from 
a poor price—performance ratio. The advances in SiGe-HBT and CMOS 
technology continuously increase the device power generation and detection 
capabilities in the THz frequency range and thus may ultimately leverage the 
commercial interest in THz imaging systems. Three-dimensional THz imag- 
ing based on the principle of computed tomography (THz-CT) is one of the 
applications that may potentially be explored in commercial environments. 
THz-CT offers volumetric object reconstruction with an image contrast based 
on the characteristic THz absorption of the illuminated material. Since THz 
radiation is non-ionizing and thus requires no dedicated safety measures, 
THz-CT represents an interesting alternative to X-ray technology for low-cost 
industrial quality control. 

The THz-CT system implemented in this work is solely based on com- 
ponents built in silicon technology that was developed in the frame of 
DOTSEVEN. Figure 6.54 shows an illustration of the THz-CT scanner. The 
radiation of a SiGe-HBT source is focused in the object plane and refocused 
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Figure 6.54 Illustration of the THz-CT scanner. The system comprises a 490 GHz SiGe- 
HBT source, an NMOS detector, and an optical train based on four f# = 2, 50 mm PTFE- 
lenses. The object is rotated (p) and stepped in the 2D object plane (y,z). 
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Figure 6.55 Photograph of the THz-CT scanner. 


to an NMOS detector with low-cost PTFE lenses. The transmission through 
the object is measured along the y-axis and for different projection angles to 
form the sinograms of the object. This pocess is repeated along the z-axis to 
allow full 3D reconstruction based on a filtered back-projection algorithm. 
In order to facilitate measurements at multiple projection angles and posi- 
tions, the location of the object at is computer-controlled by x x y x ọ 
stepper motors. For each position the detector output signal is sampled with 
a data-acquisition system. 


6.2.3.1 Components 

There are two components that define the quality of THz-CT systems. 
Increasing the DR, which is defined as the relation between maximum 
received signal without an object and the integrated noise over the readout 
bandwidth, relaxes the trade-off between object thickness, material com- 
position, and scanning time. Secondly, the achievable image resolution is 
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inversely proportional to the beam spot size in the imaging plane, which 
is defined by the wavelength and effective aperture and focal length of 
the optics. Since the output power of silicon-based radiation sources drops 
significantly when going beyond fmax, the trade-off between achievable 
DR and operational frequency becomes he bottleneck for silicon-based THz- 
CT systems. This fact stresses the need for a high-quality source design 
that maximizes the radiated power while providing sufficient directivity and 
Gaussicity. 


A. Source design 

The source in this work was implemented in Infineon DOTSEVEN 0.13 ym 
SiGe BiCMOS technology with an ft/fmax of 260/350 GHz [Hill15]. It 
comprises a broadband lens-integrated circular slot antenna coupled to a 
single-ended triple-push Colpitts oscillator. Figure 6.56 shows the schematic 
and the microgrpah of the source. An operation frequency of 490 GHz was 
chosen as a compromise between resolution and power generation capability 
of the technology. However, the output frequency is still significantly higher 
than the fmax of the technology, necessitating the use of harmonic generation 
techniques. In this design, a triple-push topology is used to extract the third 
harmonic at the base terminal of three Colpitts oscillators. The circular 
slot antenna connected to the common base node loads the circuit at the 
fundamental oscillation frequency and thus forces the oscillators to run 120° 
out of phase [Tang01]. In this mode, the third harmonic currents of all three 
oscillators add in phase, while the currents at the fundamental frequency 
superimpose destructively. Note that the symmetry of the physical design is 
very important since it directly impacts the extraction efficiency of the third 
harmonic and the rejection of the fundamental. 


a) b) 
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Figure 6.56 Schematic (a) and micrograph (b) of the 490 GHz SiGe-HBT radiator, after 
[Hill15]. 
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The design procedure for the Colpitts oscillator can be summarized as 
follows. First, small-signal simulations were used to optimize the transistor 
size and the feedback capacitor Ce by maximizing the negative resistance at 
the design frequency of 163 GHz. After that, the tank inductance TL}, was 
sized to tune out the imaginary part of the transistor input impedance. The 
series—series feedback introduced by Ce increases the reverse transmission 
behavior of the circuit and thus the impact of the collector load impedance 
on the third harmonic matching at the base terminal [Pfeil4]. In this design, 
the maximum third harmonic output power is realized by avoiding the lossy 
collector via stack and by using an ideal short circuit at the collector termi- 
nal. The broadband harmonic idler is realized with three capacitors (Com, 
Cmimi, Cmim2) that are self-resonant at the first, second, and third harmonic. 

The lossy silicon die and the strickt design rules that are usually enforced 
upon modern silicon technologies make silicon chips a very unfavourable 
environment for integrated antennas. At the same time, the requirements 
for the antenna system in a THz-CT system with a free-running source are 
high. Process variations and modelling inaccuracies can lead to a shift in the 
oscillation frequency after manufacturing which makes a broadband antenna 
design inevitable if a fist-pass design is needed. Additionally, the antenna 
needs to have a directivity of around 20 dBi to be compliant with low-cost 
optical components, i.e., 5 mm-diameter PTFE-lenses. These requirements 
call for a broadband lens-integrated antenna system that is composed of an 
on-chip primary antenna and a secondary hyper-hemispherical silicon lens. 
In this design a multi-layer linearly polarized circular slot antenna was used 
to illuminate a 4 mm-diameter silicon lens. Figure 6.57 pictures the HFSS 
model used for 3-D EM simulation of the antenna system. The simulation 
results show a directivity of 22.5 dBi and 86% radiation efficiency. 

The source module was fully characterized in free space. The radiation 
frequency of the source was measured with an 18th harmonic mixer. For the 
biasing conditions that optimize the radiated power for all supply voltages, 
the oscillation frequency is close to constant at 490 GHz. The radiated power 
was measured with a photo-acoustic power meter (TK). Figure 6.58 shows 
that the source delivers an output power of up to 38 uW with a DC-to-RF of 
up to 0.059%. 


6.2.3.2 Detector design 

The terahertz detector is a zero-bias NMOS detector fabricated in IHP’s 0.13- 
um SiGe-BiCMOS technology. The asymmetric NMOS device is derived 
from the standard 1.2 V NMOS by a modified layout of the source/drain 
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Figure 6.57 A 3-dimensional EM simulation model of the packaged source module with a 
silicon chip mounted on the back of a 4 mm hyper-hemispherical silicon lens. 
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Figure 6.58 Measured output power and DC-to-RF efficiency versus frequency. 


extension masks [Jain16]. The drain side comprises the normal high-dose 
n+ extension (HDD) and halo implants while the source side is implanted 
with the low-dose n-extension (LDD) from the 3.3 V I/O devices. Due to the 
absence of halo implants at the source side, a reduced threshold voltage of 
the transistor shifts the optimal gate bias point for highest sensitivity to zero 
volts. Similar to the source design, the detector antenna is comprised of a 
primary on-chip antenna and a secondary silicon-lens. Figure 6.59 shows the 
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Figure 6.59 Measured voltage responsivity and NEP for different gate bias voltages and 
micrograph of the detector. The modification of the source/drain extension masks shifts the 
bias point for optimum sensitivity to zero volts, after [Jain16]. 


measured voltage responsivity and noise-equivalent power (NEP) of the zero- 
bias NMOS detector for different gate bias voltages. The responsivity peaks 
at zero bias with 450 V/W and the detector shows an NEP of 80 pW/v Hz. 


6.2.3.3 THz-CT results 

The THz-CT system can be operated in two modes. A rapid acquisition mode 
with CW illumination with continous object rotation and a rotary encoder- 
based angle allocation can be used for rapid data acquisition. Furthermore, a 
acquisition mode with a chopped source and stepped object rotation can be 
used when high accuracy and DR are needed. Figure 6.60 shows the measured 
DR for different source chopping frequencies for 1 ms lock-in time constant. 
The system offers 38 dB DR in CW mode and around 60 dB for chopping 
frequencies higher than 1 kHz. The spot size and related image resolution 
was measured using the knife edge method [Gonz13]. A knife was mounted 
to a high-precision translation stage and was moved into to spot to block off 
a 2D plane from further propagation through the optical train. Figure 6.61 
shows the normalized measured power received by the detector for y- and 
z-axis knife translation in the focus point and the corresponding result of the 
Gaussian fit obtained with 


Pmax ct xo) 


P= [1 + erf(v2 l 
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Figure 6.60 Dynamic range of the THz-CT system for different chopping frequencies 
measured with a lock-in amplifier with 1 ms time constant. 
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Figure 6.61 Measured and fitted normalized power at the detector for a knife translation in 
y- and z-directions. The Gaussian beam waists are 2.54 mm in y-direction and 2.40 mm in 
z-direction. 
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Figure 6.62 Tomographic reconstruction of a Y-shaped hook driver inside a polyethylene 
container. The image was recorded with 1 mm spatial and 9° angular resolution within a 
250 min acquisition time. 
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where Pmax is the maximum received power, x — xg is the relative position 
from the beam center, and w is the 1/e? beam radius [Gonz13]. The resulting 
Gaussian beam waists in y- and z-directions are 2.54 mm and 2.40 mm. The 
values closely reassemble the theoretically estimated value of 2.26 mm for 
an effective lens aperture of 30 mm. Figure 6.62 shows the result of the 
tomographic reconstruction of a Y-shaped hook driver that is hidden inside 
a polyethylene container with a size of 26 mm x 48 mm. The image was 
recorded in stepped acquisition mode with 1 kHz chopping and a spatial 
and angular resolution of 1 mm and 9°, respectively. The total scanning 
time in this scanning time is still quite high with 250 min. However, with 
a continuous object rotation and CW detection, the overall acquisition time 
can be reduced to as low as 20 min. 
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7.1 Introduction 


The results of DOTSEVEN described in the previous chapters of this book 
mark a milestone in the development of SiGe HBT technology. This chapter 
reflects on how this milestone fits into the overall picture of semiconductor 
technologies with potential for high-speed/high-frequency applications. Fur- 
thermore, as any milestone is a temporary state, the possible future prospects 
of SiGe HBT device performance will be presented in terms of a roadmap. 
Finally, obstacles on the path toward the perceived performance limits of this 
technology are discussed. 


7.2 Technology Comparison 


Circuits operating at mm-wave frequencies have traditionally been imple- 
mented in I-V semiconductors due to the higher mobility in these materials. 
However, the high mobility occurs only at relatively low electric fields, 
which are easily exceeded under circuit-relevant bias conditions, especially 
when trying to generate high output power. Nevertheless, the fastest HBTs 
today have been fabricated in InP technology with the respective proto- 
typing processes reaching (fT, fmax) values around (0.5, 1.1) THz for 
emitter widths of 130 nm [Urte11] and 200 nm [B0116]. Compared to these 
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Figure 7.1 Operating speed comparison between SiGeC HBTs, InP HBTs, and MOSFETs 
vs. critical lithography dimensions (i.e., emitter width or channel length): (a) maximum 
oscillation frequency and (b) transit frequency. The lines represent LSQ fits of the data, the 
red filled squares DOTSEVEN results, and the larger crosses the best InP HBT data. 


devices, DOTSEVEN SiGe HBTs with (fT, fmax) = (0.5,0.72) THz are 
about one generation behind in terms of power gain cutoff frequency. 
This performance difference is displayed in Figure 7.1, which includes fp 
and fmax data of the three mainstream technology contenders gathered from 
many publications [Ros16]. The DOTSEVEN results have been marked by 
the red squares. 

Although InP HBTs do have certain advantages over SiGeC HBTs such as 
higher breakdown voltage (at the same speed) and the potential of combined 
optical/electronic operation, it was shown in [Voi04] that “at comparable fp 
and fmax, there is very little difference in their performance in narrow-band 
mm_-wave and in broadband and high-speed digital circuits”; i.e., for circuit 
applications, the advantage of the higher breakdown voltage in InP HBTs 
appears to be relatively small. 

A main disadvantage of III-V technologies is their fairly low integration 
level and yield as well as the difficulty of structural downscaling due to 
strong surface recombination and the resulting low current gain. Hence, it 
will be difficult to leverage these technologies to their full extent to enable 
both more complex mm-wave systems and, in particular, mass-market wafer 
volume in the future. As a consequence, I-V material-based technologies do 
not achieve functionality and energy efficiency increases comparable to those 
of silicon technology, which in turn makes it difficult for them to compete 
also on cost. In addition, the relatively large process variations and the 
lack of accurate modeling tools have notoriously hampered (cost-) efficient 
IV circuit design. Therefore, the I-V semiconductor industry and the 
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related investment have been focused strongly on high-frequency/high-speed 
low-volume niche applications. The market success of more recent 
approaches toward a heterogeneous integration with completed CMOS 
wafers (e.g., [Ram12]) remains unclear yet. Among the issues are certainly 
the very different process qualification criteria in the silicon (especially the 
digital CMOS) world and in the III—V world. 

According to Figure 7.1 the cutoff frequencies of the best RF-CMOS 
processes come close to those of SiGe HBTs, but at the expense of a 
significantly more advanced lithography, typically at least three lithography 
nodes. Achieving the DOTSEVEN results though would require a CMOS 
process with about a 14 nm channel length (which does not correspond to the 
14 nm node!), assuming that progress in device speed continues for CMOS as 
sketched by the corresponding dashed line in Figure 7.1. As will be discussed 
later below, this assumption is unlikely to be the case. 

The considerations so far have centered around device-related operating 
frequencies. However, in a circuit the transistors are connected to other 
devices, which along with the connection represent more or less large capaci- 
tive, inductive, and resistive loads. For instance, high-quality passive devices 
for RF applications have to be placed in the uppermost metalization levels. 
Figure 7.2(a) displays a 3D view of a typical connection between a transistor 
and the upper metal layer. The impact of this connection on transistors having 
the same speed after deembedding (about 300 GHz) is shown in Figure 7.2(b): 
the MOSFET’s fr decreases by about a factor of two, while the HBT looses 


f} (GHz) 


omeo a) (mum) 


(a) (b) 


Figure 7.2 Impact of device connections to other circuit elements (a) on the transit frequency 
of a SiGeC HBT with 120 nm emitter window width and a MOSFET of the 28 nm node (b). 
The upper lines in (b) represent the pad and pad-device connection line deembedded data, 
while the lower lines represent the un-deembedded data. 
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about 20% of its speed. Similar observations have been noted in [Inal1], 
where the peak operating frequencies (fT, fmax) of 45 nm MOSFETs drop 
by a factor of two, once the metallization necessary for building circuits is 
included. The reason why HBTs do not show this severe deterioration is 
their much higher transconductance gm and corresponding drive capability. 
In other words, devices with the same ratio of g,, and input capacitance have 
the same fr, but devices with a higher gm will fundamentally fare better in 
circuits since the device capacitance there will only be a more or less small 
fraction of the total capacitance. 

Based on the observations described above, it is therefore instructive to 
look at the values of gm when comparing process technology performance, 
since gm represents a better indication of achievable circuit speed. Figure 7.3 
shows the measured and predicted values of gm for a variety of incumbent 
and emerging process technologies. For comparison, gy, has been normalized 
to emitter length (for HBTs), gate width (for planar MOSFETs), or channel 
perimeter length (for nano-wire and -tube FETs!). It is clearly visible that 
HBTs have a much higher transconductance per finger length than FETs. 
Furthermore, FETs appear to be unable to ever catch up to HBTs in terms of 
transconductance, even when assuming ideal performance scaling according 
to the approximation of existing data (dashed lines). 

Figure 7.3 also includes the transconductance values for future technol- 
ogy nodes, which have been predicted based on detailed TCAD simulations 
and compact models for complete 3D transistors with all relevant parasitics 
included [Sch17, Voil7] and have become the basis for the 2014 and 2015 
ITRS tables. For MOSFETs, the peaks of gm (around 4 mS/um) and fr 
(around 600 GHz) are predicted for a channel length of 10 nm, while beyond 
that length the strong impact of surface scattering in the extremely thin 
channel layer will lead to a drastic decrease (to, e.g., 2m ~ 1.5 mS/um, fT 
= 200 GHz). This is in stark contrast to HBT scaling, which significantly 
benefits from smaller dimensions in all aspects of electrical performance 
until a critical base width is reached and the small emitter area leads to a 
high emitter contact resistance.” The ultimate performance of SiGe HBTs 
was investigated in [Schlla, Schl 1b] using detailed TCAD simulation and 


‘Normalizing to the perimeter takes into account (roughly) that screening effects would 
lead to a lower gm value if the wires or tubes were placed next to each other, which would 
correspond to a normalization to the diameter (or minimum footprint for a gate-all-around 
channel). 

Note though that the impact of contact resistances increases with smaller device dimen- 
sions in all technologies and becomes visible first in highly scaled FETs. 
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Figure 7.3 Comparison of the terminal (or extrinsic) transconductance of transistors from 
a large variety of process technologies. Filled symbols represent measured data and open 
symbols represent predicted (roadmap) data. For FETs, the legend designations correspond to 
the channel material and structure: planar I-V such as InGaAs (III-V bulk); planar single- 
or few atomic layers such as black phosphorus or germanane (2D); planar transition metal 
dichalcogenide such as MoSə (TMD); planar or FinFET silicon (Si-CMOS); nano-wire silicon 
or III-V (NW); carbon nano-tube (CNT). 


accurate compact models with all known physical effects and device-related 
parasitics included. The predictions resulted in a transit frequency between 
0.8 and 1 THz and maximum oscillation frequency around 2 THz, depending 
on the assumed contact resistivities especially for the emitter. For the ITRS 
tables (see also [Sch17] for more details), quite conservative values have been 
assumed, which lead to somewhat reduced device operating frequencies and 
transconductances compared to [Sch1 1b]. 

The doping profile of the technology node N3, the performance of which 
was predicted in 2013, served as the guideline for the DOTSEVEN process 
development. During process development, accurate physics-based model 
parameters were determined based on which the evaluation of the impact 
of the various physical and parasitic effects on device performance [Kor15, 
Paw17] is possible. This strategy provided valuable insight for prioritizing 
the process development tasks. It is interesting to note that the electrical 
parameters, such as base sheet resistance and capacitance per area, of the 
final process version of DOTSEVEN meet those of the roadmap node N3 
predicted earlier quite well. This validates the accuracy of the predictions and 
the employed approach. 
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The predicted progress in device speed is closely linked to the increase 
in current densities required for achieving peak operating frequencies. The 
corresponding transconductance at the device terminals is more or less 
strongly reduced by the emitter series (contact) resistance. With conservative 
assumptions about contact resistivities, transconductance values of at least 
40 mS/um can be expected according to Figure 7.3 at the present end of 
the roadmap. This value remains an order of magnitude higher than that 
predicted for the best MOSFETs and also still higher than that predicted 
for the best CNTFETs by a factor of four. Notice that due to the lack of 
sufficient hardware, the predictions of the latter are associated with much 
higher uncertainty than those for MOSFETs. 

An important aspect for mm-wave and THz applications is the achievable 
output power at a given frequency. For advanced MOSFET technologies, the 
latter is impacted significantly by the decrease of the voltage gain, which 
is caused by the increased output leakage and associated output conduc- 
tance. HBTs do not show this decrease unless they are scaled (vertically) 
beyond the last node presently shown in the ITRS tables. Output power can 
also be increased with device size. While this increases the device-related 
capacitances in both HBTs and MOSFETs, an increase in emitter length 
alone reduces all series resistances in HBTs but a corresponding gate width 
increase in FETs leads to a larger gate resistance. Moreover, the maximum 
allowed drain—source voltage in advanced RF-MOSFETs is lower than the 
maximum collector—-emitter voltage in advanced high-speed HBTs. Increas- 
ing the number of parallel devices in MOSFET-based power amplifiers leads 
to larger interconnect parasitics per unit drain width, while stacked power 
amplifier architectures require a larger number of devices and thus again more 
passives and parasitics per unit drain width. Another important building block 
in (sub-)mm-wave systems is the VCO. Its phase noise strongly depends 
on the flicker (1/f) noise of the transistors. Here, HBTs have much lower 
comer frequencies than MOSFETs, in which 1/f noise keeps increasing for 
more advanced nodes. Overall, HBTs appear to have a distinctive advantage 
in the area of mm-wave and THz amplifiers from a technical circuit design 
perspective. 

Finally, a few words on cost. With the development and added masks for 
the required RF-passives, an RF-CMOS process becomes significantly more 
costly than the corresponding digital process. Thus, a depreciated CMOS 
process with high-speed SiGe HBTs integrated (i.e., a BiCMOS process) 
can often be cheaper and thus very cost competitive to advanced RF-CMOS. 
Such a BiCMOS process not only provides better RF front-end performance 
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(in terms of analog HF features and energy efficiency) but is also earlier 
available on the market. In view of these aspects, RF-CMOS based on an 
advanced node makes sense only for applications that require (i) higher digital 
functionality and density than an already available BiCMOS process with 
comparable front-end (i.e., HBT) performance and (ii) very high product 
volume. 

Due to the already existing large investment and associated infrastructure 
in 200 and 300 mm silicon wafer fabs around the world, SiGe:C HBTs with 
operating speed in the THz range are desirable. Implemented into depreciated 
(and hence low-cost) digital CMOS platforms, the associated SiGe:C BiC- 
MOS single-chip technologies are capable of covering (at reasonable wafer 
cost) medium- and large-volume applications with mm-wave and THz analog 
front-ends, which would be either far too expensive when implemented in 
most advanced CMOS technology or would not even be possible to realize 
there due to the inferior analog characteristics of future advanced MOS- 
FETs as discussed above. Therefore, a relatively small investment in SiGe:C 
HBT process development will yield large gains in terms of (ultimately 
commodity) market coverage. 
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The terahertz or sub-millimeter frequency range, roughly defined as extend- 
ing from 300 GHz to 3 THz, has so far resisted attempts to broadly harness 
its potential for everyday applications. This led to the expression THz gap, 
loosely describing the lack of adequate technologies to effectively bridge this 
transition region between microwaves and optics — both readily accessible via 
well-developed electronic and laser-based approaches — by, e.g., integrated 
and cost-efficient electronics. THz technology is an emerging field which has 
demonstrated a wide-ranging potential. Extensive research in the last years 
has identified many attractive application areas and paved the technologi- 
cal path toward broadly usable THz systems. THz technology is currently 
in a pivotal phase and will soon be able to radically expand our analytic 
capabilities via its intrinsic benefits. 

Applications of mm-wave and THz frequencies led by the silicon inte- 
grated technologies can be subdivided into communication, radar, imaging, 
and sensing areas, as illustrated in Figure 7.4. In the following sections, we 
describe some of these applications and the recent, state-of-the-art hardware 
developments in the corresponding areas as it relates to DOTSEVEN. 
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Figure 7.4 Potential applications for silicon integrated mm-wave and THz circuits. 


7.3.1 Communication 


To improve the data capacity beyond 10 Gbps for wireless transmission, 
improving the spectral efficiency with advanced modulation schemes is no 
longer sufficient and higher bandwidth becomes an absolutely necessity. The 
mm-wave and THz bands, due to their large relative bandwidth, show great 
potential for future wireless communication [Son11]. Such large frequen- 
cies, however, suffer from greater atmospheric attenuation as compared to 
the traditional radio waves. Still, the atmospheric windows near 90 GHz, 
140 GHz, and 240 GHz are being considered for future communication and 
radar applications [Fed10]. 
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Due to a larger atmospheric attenuation, the THz waves are expected 
to be first used for indoor wireless personal-area (WPAN) and local-area 
(WLAN) networks, where the range is limited to a few tens of meters at the 
most. Along with the high-capacity, the high directivity of THz waves is also 
considered as an advantage to ensure secure (requiring line-of-sight) or high- 
density, non-interfering data networks. Some of the possible applications of 
THz communication are: 


1. Wireless distribution of HDTV content in an in-home network. While 
the current 60 fps, progressive full HD (1080p60) content requires data 
transmission rates near 3 Gbps, future ultra HD 8K content would 
require wireless data transmission rates in the ballpark of 24 Gbps 
[Son11]. 

2. High-bandwidth wireless backhauls in high-density mesh multipoint to 
point/multipoint (first-mile and last-mile) networks may require data 
rates reaching 100 Gbps. 

3. Rapid uploading and downloading of large files from a server which can 
serve as a public data kiosk. 

4. Inter- and intra-building communication networks for manufacturing 
floor automation, etc. 


Silicon—germanium HBT technology is driving the front-end development 
for mm-wave and THz bands, and several key components as well as fully 
integrated transceiver systems have been demonstrated. Figure 7.5 shows 
the output power versus frequency for some recently published SiGe mm- 
wave/THz sources. A 0.53 THz, 0 dBm free-space radiating source composed 
of 16 on-chip, non-locked radiating pixels, each with a differential triple- 
push oscillator (TPO) and on-chip ring antenna, was demonstrated in [Pfe14]. 
A similar, single-ended TPO design for a single 0.49 THz radiator was 
also shown in [Hil15]. [Hil17] demonstrated another approach of using a 
fundamental differential Colpitts oscillator with second harmonic extraction 
at 215 GHz and feeding it to a frequency doubler for 430 GHz signal radiation 
with —6.3 dBm of output power. 

Cascaded multiplier chains with larger multiplication factors have also 
been demonstrated. In this approach, an external, phase-stable signal is fed to 
the multiplier chain to generate the THz signal. In [Eril1], the x 18 multiplier 
chain consists of two cascaded tripler stages followed by a balanced doubler 
and shows a peak output power of —3 dBm at 325 GHz. 

Further signal amplification is done by on-chip integrated power ampli- 
fiers (PAs). In [Nee13], a 160 GHz PA with 20 dB gain and 10 dBm 
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Figure 7.5 Frequency versus output power of some recently published SiGe integrated mm- 
wave/THz sources. 


saturated output power (Psat) was demonstrated. A parallel power- 
combination approach with four PA cores was used in [Nee16] to show 25 dB 
gain and 9 dBm Psat for 200-225 GHz. 

In the recent years, fully integrated silicon RF front-ends operating above 
200 GHz have become feasible due to the continuous technology improve- 
ment. Both CMOS [Kan15, Thy15, Par12, Tak17] and SiGe [Fril7, Rod17, 
Rod18] front-ends have been reported in the literature. Circuits using sub- 
harmonic techniques with doubler or tripler as last stage before the antenna 
like a 240 GHz transceiver chipset utilizing QPSK in 65 nm CMOS can 
be found in the combination of [Kan15] and [Thy15]. A 260 GHz OOK 
transceiver in 65 nm bulk CMOS was reported in [Par12]. [Tak17] presents a 
CMOS transmitter working at 300 GHz capable of communicating over 5 cm 
at 56 Gbps. A complete SiGe chipset for 50 Gbps at mm distance is presented 
by [Fril7]. In [Rod17], a SiGe integrated 240 GHz transceiver chipset in 
the DOTSEVEN technology (as detailed in Chapter 6) is reported with a 
data-rate of 50 Gbps communicating over 100 cm distance. The data rate 
was further improved to 65 Gbps by post-process IF filtering at the receiver- 
end in [Rod18]. A comparison of these front-ends is presented in Table 7.1. 
Extensive research in device technology and circuits continues to improve the 
performance further, indicating that SiGe HBTs are becoming a formidable 
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Table 7.1 Si-integrated wireless communication links above 200 GHz 


Frequency Data Error 
Reference [GHz] Tx/Rx [Gbps] (Modulation) Distance 
[Kan15] 240 Tx 16 = - 
[Thy 15] 240 Rx 10/16 BER < 10° ° — 
7/1074 
(QPSK) 
[Par12] 260 Tx, Rx 14 — (OOK) 4 cm 
[Fri17] 190 Tx, Rx 50 BER < 107? 0.6 cm 
(BPSK) 
[Rod17] 240 Tx, Rx 50 EVM 100 cm 
29% (QPSK) 
[Tak17] 300 Tx 56 EVM 13.4% 5 cm 
(16 QAM) 
[Rod18] 240 Tx, Rx 65 BER 100 cm 


<10~* (QPSK) 


alternative to III-V technologies for mm-wave and THz communication, and 
SiGe HBT technology is quickly extending toward 100 Gbps data-rates. 


7.3.2 Radar 


Radar systems are used for distance and velocity sensing, and they also bene- 
fit by moving to higher frequencies. A larger available absolute bandwidth 
at higher frequencies improves the overall range resolution. Also, higher 
frequencies allow for compact radar apertures. 

One extremely popular commercial usage for high-frequency radars is for 
automotive applications with 76-77 GHz and 79-81 GHz allocated frequency 
bands. These automotive radars are being considered for a wide range of 
Advance Driver Assist Systems (ADAS), including (i) long-range (high- 
directivity, narrow forward looking beam) systems for applications such 
as adaptive cruise control, (ii) medium-range (medium directivity, beam- 
width) systems for applications such as cross-traffic alert, and (iii) short-range 
(direct proximity) systems for applications such as obstacle avoidance and 
parking assist. While such systems are already deployed at present [Has12], 
continuous improvement in technology would allow for a better performance 
(power consumption/noise figure), ultimately leading to a cost reduction and 
improved reliability. Similarly, 94 GHz constitutes another frequency band 
which is popular for aerospace and aviation radars, for application such 
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as for displaying runway image in poor weather conditions [Gos09], for 
airport ground control systems [Mar16], and weather-cloud investigations 
[Mar08]. 

Even higher frequencies allow for millimeter-range resolution. Such pre- 
cision radars can be used for industrial imaging, inspection, and automation. 
In addition, novel consumer applications such as gesture recognition [Arb13], 
[GSoli] have also started to emerge. 

Due to a similar coherent nature, the hardware advancements discussed 
for the communication chipsets also benefit the radar systems similarly. 
Within the frame of the DOTSEVEN project, a complete highly integrated 
FMCW homodyne monostatic radar system operating around 240 GHz witha 
60 GHz bandwidth and a state-of-the-art 2.57 mm range resolution was devel- 
oped [Statn15]. The radar module was used for 3-D imaging of cardboard 
boxes with a dynamic range of around 50 dB in the acquired range profiles. 
Key to the success of silicon technologies is a low-cost packaging scheme 
which needs to be developed to support low cost on the sub-component level. 
Hence, the DOTSEVEN radar chips were further packaged together with 
wideband lens-integrated on-chip annular-slot antenna [Grzyb15] and wire 
bonded onto a low-cost FR4 printed-circuit board. 


7.3.3 Imaging and Sensing 


Three-dimensional THz imaging based on the principle of computed tomog- 
raphy (THz-CT) is one of the emerging applications that may be explored 
in commercial imaging and sensing applications. THz-CT offers volumetric 
object reconstruction with an image contrast based on the characteristic THz 
absorption of the illuminated material. Since THz radiation is non-ionizing 
and thus requires no dedicated safety measures, THz-CT represents an inter- 
esting alternative to X-ray technology for low-cost industrial quality control. 
A THz-CT system solely based on components built in DOTSEVEN tech- 
nology was built and evaluated. A SiGe-HBT source [Hill15] was focused 
in the object plane and refocused to an NMOS [Jain16] detector with low- 
cost PTFE lenses. The 3D image was reconstructed from measurements at 
multiple projection angles and positions based on a filtered back-projection 
algorithm. The system offers a dynamic range of up to 60 dB at 490 GHz. 
The Gaussian beam waist sizes are 2.54 mm in the y-direction and 2.40 mm 
in the z-direction. These results show that Terahertz 3D CT imagers can be 
entirely implemented in a silicon process technology. 
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